Commit 8291b072 authored by Sven Lautenbach's avatar Sven Lautenbach
Browse files

updated data. added few studies for not including or last update post

parent eb37ed8e
This source diff could not be displayed because it is too large. You can view the blob instead.
......@@ -178,12 +178,14 @@ def run():
# TODO: this is currently broken! You need to provide as csv file manually.
new_data_df = get_trials_df_from_url()
# remove wrong case NCT04256395
studies_to_delete = ["NCT04256395", "NCT04226157", "ISRCTN51287266", "NCT03042143", "NCT03891420", "EUCTR2019-002688-89-ES"]
studies_to_delete = ["NCT04256395", "NCT04226157", "ISRCTN51287266", "NCT03042143", "NCT03891420",
"EUCTR2019-002688-89-ES", "EUCTR2017-001100-30-DE"]
for study in studies_to_delete:
new_data_df = new_data_df.drop(new_data_df[new_data_df['TrialID'] == study].index, axis=0)
# for some studies that have been running for a while and have now been updated to incorporate COVID-19
# as well the suggestions by Markus Reis and Konstantin is to use the date in "Last Update Post" instead
studies_updated = ["EUCTR2015-002340-14-NL", "NCT03331445", "NCT04061382", "NCT03680274", "NCT03808922", "NCT03042143"]
studies_updated = ["EUCTR2015-002340-14-NL", "NCT03331445", "NCT04061382", "NCT03680274", "NCT03808922", "NCT03042143"
, "NCT04092478", "EUCTR2018-004318-16-DK"]
needsUpdate = new_data_df['TrialID'].isin(studies_updated)
new_data_df["Date registration"] = np.where(needsUpdate,
new_data_df["Last Refreshed on"],
......
This source diff could not be displayed because it is too large. You can view the blob instead.
This source diff could not be displayed because it is too large. You can view the blob instead.
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment