Skip to content

Part 4 Local

Instead of using the CLI/T-Tool/ETL-Tool. The process can be run with a custom python script to give more control over the inputs/outputs and various configurations

import carrot
import glob
inputs =  carrot.tools.load_csv(glob.glob('../data/part2/*'))
inputs
2022-06-17 14:49:16 - LocalDataCollection - INFO - DataCollection Object Created
2022-06-17 14:49:16 - LocalDataCollection - INFO - Registering  Blood_Test.csv [<carrot.io.common.DataBrick object at 0x106aba280>]
2022-06-17 14:49:16 - LocalDataCollection - INFO - Registering  Demographics.csv [<carrot.io.common.DataBrick object at 0x106af5ca0>]
2022-06-17 14:49:16 - LocalDataCollection - INFO - Registering  GP_Records.csv [<carrot.io.common.DataBrick object at 0x10a7e3fd0>]
2022-06-17 14:49:16 - LocalDataCollection - INFO - Registering  Hospital_Visit.csv [<carrot.io.common.DataBrick object at 0x10a7e3fa0>]
2022-06-17 14:49:16 - LocalDataCollection - INFO - Registering  Serology.csv [<carrot.io.common.DataBrick object at 0x106af5430>]
2022-06-17 14:49:16 - LocalDataCollection - INFO - Registering  Symptoms.csv [<carrot.io.common.DataBrick object at 0x106af5580>]
2022-06-17 14:49:16 - LocalDataCollection - INFO - Registering  Vaccinations.csv [<carrot.io.common.DataBrick object at 0x10a8476d0>]
2022-06-17 14:49:16 - LocalDataCollection - INFO - Registering  pks.csv [<carrot.io.common.DataBrick object at 0x10a8476a0>]

<carrot.io.plugins.local.LocalDataCollection at 0x106af5d60>
inputs.keys()
dict_keys(['Blood_Test.csv', 'Demographics.csv', 'GP_Records.csv', 'Hospital_Visit.csv', 'Serology.csv', 'Symptoms.csv', 'Vaccinations.csv', 'pks.csv'])
outputs = carrot.tools.create_csv_store(output_folder="./test_outputs/local/")
outputs
2022-06-17 14:49:16 - LocalDataCollection - INFO - DataCollection Object Created

<carrot.io.plugins.local.LocalDataCollection at 0x10a847a00>
rules = carrot.tools.load_json("../data/rules.json")
cdm = carrot.cdm.CommonDataModel.from_rules(rules,inputs=inputs,outputs=outputs)
cdm.get_tables()
2022-06-17 14:49:16 - CommonDataModel - INFO - CommonDataModel (5.3.1) created with co-connect-tools version 0.0.0
2022-06-17 14:49:16 - CommonDataModel - INFO - Running with an DataCollection object
2022-06-17 14:49:16 - CommonDataModel - INFO - Turning on automatic cdm column filling
2022-06-17 14:49:16 - CommonDataModel - INFO - Added MALE 3025 of type person
2022-06-17 14:49:16 - CommonDataModel - INFO - Added FEMALE 3026 of type person
2022-06-17 14:49:16 - CommonDataModel - INFO - Added Antibody 3027 of type observation
2022-06-17 14:49:16 - CommonDataModel - INFO - Added H/O: heart failure 3043 of type observation
2022-06-17 14:49:16 - CommonDataModel - INFO - Added 2019-nCoV 3044 of type observation
2022-06-17 14:49:16 - CommonDataModel - INFO - Added Cancer 3045 of type observation
2022-06-17 14:49:16 - CommonDataModel - INFO - Added Headache 3028 of type condition_occurrence
2022-06-17 14:49:16 - CommonDataModel - INFO - Added Fatigue 3029 of type condition_occurrence
2022-06-17 14:49:16 - CommonDataModel - INFO - Added Dizziness 3030 of type condition_occurrence
2022-06-17 14:49:16 - CommonDataModel - INFO - Added Cough 3031 of type condition_occurrence
2022-06-17 14:49:16 - CommonDataModel - INFO - Added Fever 3032 of type condition_occurrence
2022-06-17 14:49:16 - CommonDataModel - INFO - Added Muscle pain 3033 of type condition_occurrence
2022-06-17 14:49:16 - CommonDataModel - INFO - Added Pneumonia 3042 of type condition_occurrence
2022-06-17 14:49:16 - CommonDataModel - INFO - Added Mental health problem 3046 of type condition_occurrence
2022-06-17 14:49:16 - CommonDataModel - INFO - Added Mental disorder 3047 of type condition_occurrence
2022-06-17 14:49:16 - CommonDataModel - INFO - Added Type 2 diabetes mellitus 3048 of type condition_occurrence
2022-06-17 14:49:16 - CommonDataModel - INFO - Added Ischemic heart disease 3049 of type condition_occurrence
2022-06-17 14:49:16 - CommonDataModel - INFO - Added Hypertensive disorder 3050 of type condition_occurrence
2022-06-17 14:49:16 - CommonDataModel - INFO - Added COVID-19 vaccine 3034 of type drug_exposure
2022-06-17 14:49:16 - CommonDataModel - INFO - Added COVID-19 vaccine 3035 of type drug_exposure
2022-06-17 14:49:16 - CommonDataModel - INFO - Added COVID-19 vaccine 3036 of type drug_exposure
2022-06-17 14:49:16 - CommonDataModel - INFO - Added SARS-CoV-2 (COVID-19) vaccine, mRNA-1273 0.2 MG/ML Injectable Suspension 3040 of type drug_exposure
2022-06-17 14:49:16 - CommonDataModel - INFO - Added SARS-CoV-2 (COVID-19) vaccine, mRNA-BNT162b2 0.1 MG/ML Injectable Suspension 3041 of type drug_exposure

['person', 'observation', 'condition_occurrence', 'drug_exposure']
cdm.get_objects()
{'person': {'MALE 3025': <carrot.cdm.objects.versions.v5_3_1.person.Person at 0x106af5700>,
  'FEMALE 3026': <carrot.cdm.objects.versions.v5_3_1.person.Person at 0x106af5850>},
 'observation': {'Antibody 3027': <carrot.cdm.objects.versions.v5_3_1.observation.Observation at 0x10a8a6ca0>,
  'H/O: heart failure 3043': <carrot.cdm.objects.versions.v5_3_1.observation.Observation at 0x10a8aa730>,
  '2019-nCoV 3044': <carrot.cdm.objects.versions.v5_3_1.observation.Observation at 0x10a8a6d30>,
  'Cancer 3045': <carrot.cdm.objects.versions.v5_3_1.observation.Observation at 0x10a8aab80>},
 'condition_occurrence': {'Headache 3028': <carrot.cdm.objects.versions.v5_3_1.condition_occurrence.ConditionOccurrence at 0x10a8aab20>,
  'Fatigue 3029': <carrot.cdm.objects.versions.v5_3_1.condition_occurrence.ConditionOccurrence at 0x10a8ad880>,
  'Dizziness 3030': <carrot.cdm.objects.versions.v5_3_1.condition_occurrence.ConditionOccurrence at 0x10a8adf70>,
  'Cough 3031': <carrot.cdm.objects.versions.v5_3_1.condition_occurrence.ConditionOccurrence at 0x10a8ad280>,
  'Fever 3032': <carrot.cdm.objects.versions.v5_3_1.condition_occurrence.ConditionOccurrence at 0x10a8b3dc0>,
  'Muscle pain 3033': <carrot.cdm.objects.versions.v5_3_1.condition_occurrence.ConditionOccurrence at 0x106af5190>,
  'Pneumonia 3042': <carrot.cdm.objects.versions.v5_3_1.condition_occurrence.ConditionOccurrence at 0x10a8b3d00>,
  'Mental health problem 3046': <carrot.cdm.objects.versions.v5_3_1.condition_occurrence.ConditionOccurrence at 0x10a8b8f10>,
  'Mental disorder 3047': <carrot.cdm.objects.versions.v5_3_1.condition_occurrence.ConditionOccurrence at 0x10a8b8b20>,
  'Type 2 diabetes mellitus 3048': <carrot.cdm.objects.versions.v5_3_1.condition_occurrence.ConditionOccurrence at 0x10a8bcdf0>,
  'Ischemic heart disease 3049': <carrot.cdm.objects.versions.v5_3_1.condition_occurrence.ConditionOccurrence at 0x10a8b8bb0>,
  'Hypertensive disorder 3050': <carrot.cdm.objects.versions.v5_3_1.condition_occurrence.ConditionOccurrence at 0x10a8bca00>},
 'drug_exposure': {'COVID-19 vaccine 3034': <carrot.cdm.objects.versions.v5_3_1.drug_exposure.DrugExposure at 0x10a8c14c0>,
  'COVID-19 vaccine 3035': <carrot.cdm.objects.versions.v5_3_1.drug_exposure.DrugExposure at 0x10a8c1fd0>,
  'COVID-19 vaccine 3036': <carrot.cdm.objects.versions.v5_3_1.drug_exposure.DrugExposure at 0x10a8c1910>,
  'SARS-CoV-2 (COVID-19) vaccine, mRNA-1273 0.2 MG/ML Injectable Suspension 3040': <carrot.cdm.objects.versions.v5_3_1.drug_exposure.DrugExposure at 0x10a8c1850>,
  'SARS-CoV-2 (COVID-19) vaccine, mRNA-BNT162b2 0.1 MG/ML Injectable Suspension 3041': <carrot.cdm.objects.versions.v5_3_1.drug_exposure.DrugExposure at 0x10a8c5d60>}}
cdm.process()
2022-06-17 14:49:17 - CommonDataModel - INFO - Starting processing in order: ['person', 'observation', 'condition_occurrence', 'drug_exposure']
2022-06-17 14:49:17 - CommonDataModel - INFO - Number of objects to process for each table...
{
      "person": 2,
      "observation": 4,
      "condition_occurrence": 12,
      "drug_exposure": 5
}
2022-06-17 14:49:17 - CommonDataModel - INFO - for person: found 2 objects
2022-06-17 14:49:17 - CommonDataModel - INFO - working on person
2022-06-17 14:49:17 - CommonDataModel - INFO - starting on MALE 3025
2022-06-17 14:49:17 - Person - INFO - Called apply_rules
2022-06-17 14:49:17 - LocalDataCollection - INFO - Retrieving initial dataframe for 'Demographics.csv' for the first time
2022-06-17 14:49:17 - Person - INFO - Mapped birth_datetime
2022-06-17 14:49:17 - Person - INFO - Mapped gender_concept_id
2022-06-17 14:49:17 - Person - INFO - Mapped gender_source_concept_id
2022-06-17 14:49:17 - Person - INFO - Mapped gender_source_value
2022-06-17 14:49:17 - Person - INFO - Mapped person_id
2022-06-17 14:49:17 - Person - WARNING - Requiring non-null values in gender_concept_id removed 216 rows, leaving 284 rows.
2022-06-17 14:49:17 - Person - INFO - Automatically formatting data columns.
2022-06-17 14:49:17 - Person - INFO - created df (0x10a8f3c70)[MALE_3025]
2022-06-17 14:49:17 - CommonDataModel - INFO - finished MALE 3025 (0x10a8f3c70) ... 1/2 completed, 284 rows
2022-06-17 14:49:17 - LocalDataCollection - INFO - making output folder ./test_outputs/local/
2022-06-17 14:49:17 - LocalDataCollection - INFO - saving person_ids to ./test_outputs/local//person_ids.csv
2022-06-17 14:49:17 - LocalDataCollection - INFO - finished save to file
2022-06-17 14:49:17 - CommonDataModel - INFO - starting on FEMALE 3026
2022-06-17 14:49:17 - Person - INFO - Called apply_rules
2022-06-17 14:49:17 - Person - INFO - Mapped birth_datetime
2022-06-17 14:49:17 - Person - INFO - Mapped gender_concept_id
2022-06-17 14:49:17 - Person - INFO - Mapped gender_source_concept_id
2022-06-17 14:49:17 - Person - INFO - Mapped gender_source_value
2022-06-17 14:49:17 - Person - INFO - Mapped person_id
2022-06-17 14:49:17 - Person - WARNING - Requiring non-null values in gender_concept_id removed 286 rows, leaving 214 rows.
2022-06-17 14:49:17 - Person - INFO - Automatically formatting data columns.
2022-06-17 14:49:17 - Person - INFO - created df (0x10a919bb0)[FEMALE_3026]
2022-06-17 14:49:17 - CommonDataModel - INFO - finished FEMALE 3026 (0x10a919bb0) ... 2/2 completed, 214 rows
2022-06-17 14:49:17 - LocalDataCollection - INFO - updating person_ids in ./test_outputs/local//person_ids.csv
2022-06-17 14:49:17 - LocalDataCollection - INFO - finished save to file
2022-06-17 14:49:17 - CommonDataModel - INFO - saving dataframe (0x10a919be0) to <carrot.io.plugins.local.LocalDataCollection object at 0x10a847a00>
2022-06-17 14:49:17 - LocalDataCollection - INFO - saving person to ./test_outputs/local//person.csv
2022-06-17 14:49:17 - LocalDataCollection - INFO - finished save to file
2022-06-17 14:49:17 - CommonDataModel - INFO - finalised person on iteration 0 producing 498 rows from 2 tables
2022-06-17 14:49:17 - LocalDataCollection - INFO - Getting next chunk of data
2022-06-17 14:49:17 - LocalDataCollection - INFO - All input files for this object have now been used.
2022-06-17 14:49:17 - LocalDataCollection - INFO - resetting used bricks
2022-06-17 14:49:17 - CommonDataModel - INFO - for observation: found 4 objects
2022-06-17 14:49:17 - CommonDataModel - INFO - working on observation
2022-06-17 14:49:17 - CommonDataModel - INFO - starting on Antibody 3027
2022-06-17 14:49:17 - Observation - INFO - Called apply_rules
2022-06-17 14:49:17 - LocalDataCollection - INFO - Retrieving initial dataframe for 'Serology.csv' for the first time
2022-06-17 14:49:17 - Observation - INFO - Mapped observation_concept_id
2022-06-17 14:49:17 - Observation - INFO - Mapped observation_datetime
2022-06-17 14:49:17 - Observation - INFO - Mapped observation_source_concept_id
2022-06-17 14:49:17 - Observation - INFO - Mapped observation_source_value
2022-06-17 14:49:17 - Observation - INFO - Mapped person_id
2022-06-17 14:49:17 - Observation - INFO - Automatically formatting data columns.
2022-06-17 14:49:17 - Observation - INFO - created df (0x10aba8730)[Antibody_3027]
2022-06-17 14:49:17 - CommonDataModel - INFO - finished Antibody 3027 (0x10aba8730) ... 1/4 completed, 204 rows
2022-06-17 14:49:17 - CommonDataModel - INFO - starting on H/O: heart failure 3043
2022-06-17 14:49:17 - Observation - INFO - Called apply_rules
2022-06-17 14:49:17 - LocalDataCollection - INFO - Retrieving initial dataframe for 'Hospital_Visit.csv' for the first time
2022-06-17 14:49:17 - Observation - INFO - Mapped observation_concept_id
2022-06-17 14:49:17 - Observation - INFO - Mapped observation_datetime
2022-06-17 14:49:17 - Observation - INFO - Mapped observation_source_concept_id
2022-06-17 14:49:17 - Observation - INFO - Mapped observation_source_value
2022-06-17 14:49:17 - Observation - INFO - Mapped person_id
2022-06-17 14:49:18 - Observation - WARNING - Requiring non-null values in observation_concept_id removed 487 rows, leaving 113 rows.
2022-06-17 14:49:18 - Observation - INFO - Automatically formatting data columns.
2022-06-17 14:49:18 - Observation - INFO - created df (0x10aba8c70)[H_O_heart_failure_3043]
2022-06-17 14:49:18 - CommonDataModel - INFO - finished H/O: heart failure 3043 (0x10aba8c70) ... 2/4 completed, 113 rows
2022-06-17 14:49:18 - CommonDataModel - INFO - starting on 2019-nCoV 3044
2022-06-17 14:49:18 - Observation - INFO - Called apply_rules
2022-06-17 14:49:18 - Observation - INFO - Mapped observation_concept_id
2022-06-17 14:49:18 - Observation - INFO - Mapped observation_datetime
2022-06-17 14:49:18 - Observation - INFO - Mapped observation_source_concept_id
2022-06-17 14:49:18 - Observation - INFO - Mapped observation_source_value
2022-06-17 14:49:18 - Observation - INFO - Mapped person_id
2022-06-17 14:49:18 - Observation - WARNING - Requiring non-null values in observation_concept_id removed 506 rows, leaving 94 rows.

2022-06-17 14:49:18 - Observation - INFO - Automatically formatting data columns.
2022-06-17 14:49:18 - Observation - INFO - created df (0x10abb9cd0)[2019_nCoV_3044]
2022-06-17 14:49:18 - CommonDataModel - INFO - finished 2019-nCoV 3044 (0x10abb9cd0) ... 3/4 completed, 94 rows
2022-06-17 14:49:18 - CommonDataModel - INFO - starting on Cancer 3045
2022-06-17 14:49:18 - Observation - INFO - Called apply_rules
2022-06-17 14:49:18 - Observation - INFO - Mapped observation_concept_id
2022-06-17 14:49:18 - Observation - INFO - Mapped observation_datetime
2022-06-17 14:49:18 - Observation - INFO - Mapped observation_source_concept_id
2022-06-17 14:49:18 - Observation - INFO - Mapped observation_source_value
2022-06-17 14:49:18 - Observation - INFO - Mapped person_id
2022-06-17 14:49:18 - Observation - WARNING - Requiring non-null values in observation_concept_id removed 426 rows, leaving 174 rows.
2022-06-17 14:49:18 - Observation - INFO - Automatically formatting data columns.
2022-06-17 14:49:18 - Observation - INFO - created df (0x10abc4f10)[Cancer_3045]
2022-06-17 14:49:18 - CommonDataModel - INFO - finished Cancer 3045 (0x10abc4f10) ... 4/4 completed, 174 rows
2022-06-17 14:49:18 - CommonDataModel - INFO - saving dataframe (0x10abb9160) to <carrot.io.plugins.local.LocalDataCollection object at 0x10a847a00>
2022-06-17 14:49:18 - LocalDataCollection - INFO - saving observation to ./test_outputs/local//observation.csv
2022-06-17 14:49:18 - LocalDataCollection - INFO - finished save to file
2022-06-17 14:49:18 - CommonDataModel - INFO - finalised observation on iteration 0 producing 585 rows from 4 tables
2022-06-17 14:49:18 - LocalDataCollection - INFO - Getting next chunk of data
2022-06-17 14:49:18 - LocalDataCollection - INFO - All input files for this object have now been used.
2022-06-17 14:49:18 - LocalDataCollection - INFO - resetting used bricks
2022-06-17 14:49:18 - CommonDataModel - INFO - for condition_occurrence: found 12 objects
2022-06-17 14:49:18 - CommonDataModel - INFO - working on condition_occurrence
2022-06-17 14:49:18 - CommonDataModel - INFO - starting on Headache 3028
2022-06-17 14:49:18 - ConditionOccurrence - INFO - Called apply_rules
2022-06-17 14:49:18 - LocalDataCollection - INFO - Retrieving initial dataframe for 'Symptoms.csv' for the first time
2022-06-17 14:49:18 - ConditionOccurrence - INFO - Mapped condition_concept_id
2022-06-17 14:49:18 - ConditionOccurrence - INFO - Mapped condition_end_datetime
2022-06-17 14:49:18 - ConditionOccurrence - INFO - Mapped condition_source_concept_id
2022-06-17 14:49:18 - ConditionOccurrence - INFO - Mapped condition_source_value
2022-06-17 14:49:18 - ConditionOccurrence - INFO - Mapped condition_start_datetime
2022-06-17 14:49:18 - ConditionOccurrence - INFO - Mapped person_id
2022-06-17 14:49:18 - ConditionOccurrence - WARNING - Requiring non-null values in condition_concept_id removed 39 rows, leaving 126 rows.
2022-06-17 14:49:18 - ConditionOccurrence - WARNING - Requiring non-null values in condition_start_datetime removed 1 rows, leaving 125 rows.
2022-06-17 14:49:18 - ConditionOccurrence - INFO - Automatically formatting data columns.
2022-06-17 14:49:18 - ConditionOccurrence - INFO - created df (0x10abc49a0)[Headache_3028]
2022-06-17 14:49:18 - CommonDataModel - INFO - finished Headache 3028 (0x10abc49a0) ... 1/12 completed, 125 rows
2022-06-17 14:49:18 - CommonDataModel - ERROR - There are person_ids in this table that are not in the output person table!
2022-06-17 14:49:18 - CommonDataModel - ERROR - Either they are not in the original data, or while creating the person table, 
2022-06-17 14:49:18 - CommonDataModel - ERROR - studies have been removed due to lack of required fields, such as birthdate.
2022-06-17 14:49:18 - CommonDataModel - ERROR - 123/125 were good, 2 studies are removed.
2022-06-17 14:49:18 - CommonDataModel - INFO - starting on Fatigue 3029
2022-06-17 14:49:18 - ConditionOccurrence - INFO - Called apply_rules
2022-06-17 14:49:18 - ConditionOccurrence - INFO - Mapped condition_concept_id
2022-06-17 14:49:18 - ConditionOccurrence - INFO - Mapped condition_end_datetime
2022-06-17 14:49:18 - ConditionOccurrence - INFO - Mapped condition_source_concept_id
2022-06-17 14:49:18 - ConditionOccurrence - INFO - Mapped condition_source_value
2022-06-17 14:49:18 - ConditionOccurrence - INFO - Mapped condition_start_datetime
2022-06-17 14:49:18 - ConditionOccurrence - INFO - Mapped person_id
2022-06-17 14:49:18 - ConditionOccurrence - WARNING - Requiring non-null values in condition_concept_id removed 52 rows, leaving 113 rows.
2022-06-17 14:49:18 - ConditionOccurrence - WARNING - Requiring non-null values in condition_start_datetime removed 1 rows, leaving 112 rows.
2022-06-17 14:49:18 - ConditionOccurrence - INFO - Automatically formatting data columns.
2022-06-17 14:49:18 - ConditionOccurrence - INFO - created df (0x10abc4280)[Fatigue_3029]
2022-06-17 14:49:18 - CommonDataModel - INFO - finished Fatigue 3029 (0x10abc4280) ... 2/12 completed, 112 rows
2022-06-17 14:49:18 - CommonDataModel - ERROR - There are person_ids in this table that are not in the output person table!
2022-06-17 14:49:18 - CommonDataModel - ERROR - Either they are not in the original data, or while creating the person table, 
2022-06-17 14:49:18 - CommonDataModel - ERROR - studies have been removed due to lack of required fields, such as birthdate.
2022-06-17 14:49:18 - CommonDataModel - ERROR - 110/112 were good, 2 studies are removed.
2022-06-17 14:49:18 - CommonDataModel - INFO - starting on Dizziness 3030
2022-06-17 14:49:18 - ConditionOccurrence - INFO - Called apply_rules
2022-06-17 14:49:18 - ConditionOccurrence - INFO - Mapped condition_concept_id
2022-06-17 14:49:18 - ConditionOccurrence - INFO - Mapped condition_end_datetime
2022-06-17 14:49:18 - ConditionOccurrence - INFO - Mapped condition_source_concept_id
2022-06-17 14:49:18 - ConditionOccurrence - INFO - Mapped condition_source_value
2022-06-17 14:49:18 - ConditionOccurrence - INFO - Mapped condition_start_datetime
2022-06-17 14:49:18 - ConditionOccurrence - INFO - Mapped person_id
2022-06-17 14:49:18 - ConditionOccurrence - WARNING - Requiring non-null values in condition_concept_id removed 88 rows, leaving 77 rows.

2022-06-17 14:49:19 - ConditionOccurrence - INFO - Automatically formatting data columns.
2022-06-17 14:49:19 - ConditionOccurrence - INFO - created df (0x10ac069d0)[Dizziness_3030]
2022-06-17 14:49:19 - CommonDataModel - INFO - finished Dizziness 3030 (0x10ac069d0) ... 3/12 completed, 77 rows
2022-06-17 14:49:19 - CommonDataModel - INFO - starting on Cough 3031
2022-06-17 14:49:19 - ConditionOccurrence - INFO - Called apply_rules
2022-06-17 14:49:19 - ConditionOccurrence - INFO - Mapped condition_concept_id
2022-06-17 14:49:19 - ConditionOccurrence - INFO - Mapped condition_end_datetime
2022-06-17 14:49:19 - ConditionOccurrence - INFO - Mapped condition_source_concept_id
2022-06-17 14:49:19 - ConditionOccurrence - INFO - Mapped condition_source_value
2022-06-17 14:49:19 - ConditionOccurrence - INFO - Mapped condition_start_datetime
2022-06-17 14:49:19 - ConditionOccurrence - INFO - Mapped person_id
2022-06-17 14:49:19 - ConditionOccurrence - WARNING - Requiring non-null values in condition_concept_id removed 60 rows, leaving 105 rows.
2022-06-17 14:49:19 - ConditionOccurrence - INFO - Automatically formatting data columns.
2022-06-17 14:49:19 - ConditionOccurrence - INFO - created df (0x10ac0bf10)[Cough_3031]
2022-06-17 14:49:19 - CommonDataModel - INFO - finished Cough 3031 (0x10ac0bf10) ... 4/12 completed, 105 rows
2022-06-17 14:49:19 - CommonDataModel - ERROR - There are person_ids in this table that are not in the output person table!
2022-06-17 14:49:19 - CommonDataModel - ERROR - Either they are not in the original data, or while creating the person table, 
2022-06-17 14:49:19 - CommonDataModel - ERROR - studies have been removed due to lack of required fields, such as birthdate.
2022-06-17 14:49:19 - CommonDataModel - ERROR - 103/105 were good, 2 studies are removed.
2022-06-17 14:49:19 - CommonDataModel - INFO - starting on Fever 3032
2022-06-17 14:49:19 - ConditionOccurrence - INFO - Called apply_rules
2022-06-17 14:49:19 - ConditionOccurrence - INFO - Mapped condition_concept_id
2022-06-17 14:49:19 - ConditionOccurrence - INFO - Mapped condition_end_datetime
2022-06-17 14:49:19 - ConditionOccurrence - INFO - Mapped condition_source_concept_id
2022-06-17 14:49:19 - ConditionOccurrence - INFO - Mapped condition_source_value
2022-06-17 14:49:19 - ConditionOccurrence - INFO - Mapped condition_start_datetime
2022-06-17 14:49:19 - ConditionOccurrence - INFO - Mapped person_id
2022-06-17 14:49:19 - ConditionOccurrence - WARNING - Requiring non-null values in condition_concept_id removed 128 rows, leaving 37 rows.
2022-06-17 14:49:19 - ConditionOccurrence - INFO - Automatically formatting data columns.
2022-06-17 14:49:19 - ConditionOccurrence - INFO - created df (0x10ac05970)[Fever_3032]
2022-06-17 14:49:19 - CommonDataModel - INFO - finished Fever 3032 (0x10ac05970) ... 5/12 completed, 37 rows
2022-06-17 14:49:19 - CommonDataModel - INFO - starting on Muscle pain 3033
2022-06-17 14:49:19 - ConditionOccurrence - INFO - Called apply_rules
2022-06-17 14:49:19 - ConditionOccurrence - INFO - Mapped condition_concept_id
2022-06-17 14:49:19 - ConditionOccurrence - INFO - Mapped condition_end_datetime
2022-06-17 14:49:19 - ConditionOccurrence - INFO - Mapped condition_source_concept_id
2022-06-17 14:49:19 - ConditionOccurrence - INFO - Mapped condition_source_value
2022-06-17 14:49:19 - ConditionOccurrence - INFO - Mapped condition_start_datetime
2022-06-17 14:49:19 - ConditionOccurrence - INFO - Mapped person_id
2022-06-17 14:49:19 - ConditionOccurrence - WARNING - Requiring non-null values in condition_concept_id removed 147 rows, leaving 18 rows.
2022-06-17 14:49:19 - ConditionOccurrence - INFO - Automatically formatting data columns.
2022-06-17 14:49:19 - ConditionOccurrence - INFO - created df (0x10ac2eb20)[Muscle_pain_3033]
2022-06-17 14:49:19 - CommonDataModel - INFO - finished Muscle pain 3033 (0x10ac2eb20) ... 6/12 completed, 18 rows
2022-06-17 14:49:19 - CommonDataModel - INFO - starting on Pneumonia 3042
2022-06-17 14:49:19 - ConditionOccurrence - INFO - Called apply_rules
2022-06-17 14:49:19 - LocalDataCollection - INFO - Retrieving initial dataframe for 'Hospital_Visit.csv' for the first time
2022-06-17 14:49:19 - ConditionOccurrence - INFO - Mapped condition_concept_id
2022-06-17 14:49:19 - ConditionOccurrence - INFO - Mapped condition_end_datetime
2022-06-17 14:49:19 - ConditionOccurrence - INFO - Mapped condition_source_concept_id
2022-06-17 14:49:19 - ConditionOccurrence - INFO - Mapped condition_source_value
2022-06-17 14:49:19 - ConditionOccurrence - INFO - Mapped condition_start_datetime
2022-06-17 14:49:19 - ConditionOccurrence - INFO - Mapped person_id
2022-06-17 14:49:19 - ConditionOccurrence - WARNING - Requiring non-null values in condition_concept_id removed 509 rows, leaving 91 rows.
2022-06-17 14:49:19 - ConditionOccurrence - INFO - Automatically formatting data columns.
2022-06-17 14:49:19 - ConditionOccurrence - INFO - created df (0x10ac24190)[Pneumonia_3042]
2022-06-17 14:49:19 - CommonDataModel - INFO - finished Pneumonia 3042 (0x10ac24190) ... 7/12 completed, 91 rows
2022-06-17 14:49:19 - CommonDataModel - ERROR - There are person_ids in this table that are not in the output person table!
2022-06-17 14:49:19 - CommonDataModel - ERROR - Either they are not in the original data, or while creating the person table, 
2022-06-17 14:49:19 - CommonDataModel - ERROR - studies have been removed due to lack of required fields, such as birthdate.
2022-06-17 14:49:19 - CommonDataModel - ERROR - 90/91 were good, 1 studies are removed.
2022-06-17 14:49:19 - CommonDataModel - INFO - starting on Mental health problem 3046
2022-06-17 14:49:19 - ConditionOccurrence - INFO - Called apply_rules
2022-06-17 14:49:19 - LocalDataCollection - INFO - Retrieving initial dataframe for 'GP_Records.csv' for the first time
2022-06-17 14:49:19 - ConditionOccurrence - INFO - Mapped condition_concept_id
2022-06-17 14:49:19 - ConditionOccurrence - INFO - Mapped condition_end_datetime
2022-06-17 14:49:19 - ConditionOccurrence - INFO - Mapped condition_source_concept_id
2022-06-17 14:49:19 - ConditionOccurrence - INFO - Mapped condition_source_value

2022-06-17 14:49:19 - ConditionOccurrence - INFO - Mapped condition_start_datetime
2022-06-17 14:49:19 - ConditionOccurrence - INFO - Mapped person_id
2022-06-17 14:49:19 - ConditionOccurrence - WARNING - Requiring non-null values in condition_concept_id removed 742 rows, leaving 225 rows.
2022-06-17 14:49:19 - ConditionOccurrence - INFO - Automatically formatting data columns.
2022-06-17 14:49:19 - ConditionOccurrence - INFO - created df (0x10abc4820)[Mental_health_problem_3046]
2022-06-17 14:49:19 - CommonDataModel - INFO - finished Mental health problem 3046 (0x10abc4820) ... 8/12 completed, 225 rows
2022-06-17 14:49:20 - CommonDataModel - ERROR - There are person_ids in this table that are not in the output person table!
2022-06-17 14:49:20 - CommonDataModel - ERROR - Either they are not in the original data, or while creating the person table, 
2022-06-17 14:49:20 - CommonDataModel - ERROR - studies have been removed due to lack of required fields, such as birthdate.
2022-06-17 14:49:20 - CommonDataModel - ERROR - 224/225 were good, 1 studies are removed.
2022-06-17 14:49:20 - CommonDataModel - INFO - starting on Mental disorder 3047
2022-06-17 14:49:20 - ConditionOccurrence - INFO - Called apply_rules
2022-06-17 14:49:20 - ConditionOccurrence - INFO - Mapped condition_concept_id
2022-06-17 14:49:20 - ConditionOccurrence - INFO - Mapped condition_end_datetime
2022-06-17 14:49:20 - ConditionOccurrence - INFO - Mapped condition_source_concept_id
2022-06-17 14:49:20 - ConditionOccurrence - INFO - Mapped condition_source_value
2022-06-17 14:49:20 - ConditionOccurrence - INFO - Mapped condition_start_datetime
2022-06-17 14:49:20 - ConditionOccurrence - INFO - Mapped person_id
2022-06-17 14:49:20 - ConditionOccurrence - WARNING - Requiring non-null values in condition_concept_id removed 742 rows, leaving 225 rows.
2022-06-17 14:49:20 - ConditionOccurrence - INFO - Automatically formatting data columns.
2022-06-17 14:49:20 - ConditionOccurrence - INFO - created df (0x10ac3fcd0)[Mental_disorder_3047]
2022-06-17 14:49:20 - CommonDataModel - INFO - finished Mental disorder 3047 (0x10ac3fcd0) ... 9/12 completed, 225 rows
2022-06-17 14:49:20 - CommonDataModel - ERROR - There are person_ids in this table that are not in the output person table!
2022-06-17 14:49:20 - CommonDataModel - ERROR - Either they are not in the original data, or while creating the person table, 
2022-06-17 14:49:20 - CommonDataModel - ERROR - studies have been removed due to lack of required fields, such as birthdate.
2022-06-17 14:49:20 - CommonDataModel - ERROR - 224/225 were good, 1 studies are removed.
2022-06-17 14:49:20 - CommonDataModel - INFO - starting on Type 2 diabetes mellitus 3048
2022-06-17 14:49:20 - ConditionOccurrence - INFO - Called apply_rules
2022-06-17 14:49:20 - ConditionOccurrence - INFO - Mapped condition_concept_id
2022-06-17 14:49:20 - ConditionOccurrence - INFO - Mapped condition_end_datetime
2022-06-17 14:49:20 - ConditionOccurrence - INFO - Mapped condition_source_concept_id
2022-06-17 14:49:20 - ConditionOccurrence - INFO - Mapped condition_source_value
2022-06-17 14:49:20 - ConditionOccurrence - INFO - Mapped condition_start_datetime
2022-06-17 14:49:20 - ConditionOccurrence - INFO - Mapped person_id
2022-06-17 14:49:20 - ConditionOccurrence - WARNING - Requiring non-null values in condition_concept_id removed 851 rows, leaving 116 rows.
2022-06-17 14:49:20 - ConditionOccurrence - INFO - Automatically formatting data columns.
2022-06-17 14:49:20 - ConditionOccurrence - INFO - created df (0x10ac5df40)[Type_2_diabetes_mellitus_3048]
2022-06-17 14:49:20 - CommonDataModel - INFO - finished Type 2 diabetes mellitus 3048 (0x10ac5df40) ... 10/12 completed, 116 rows
2022-06-17 14:49:20 - CommonDataModel - INFO - starting on Ischemic heart disease 3049
2022-06-17 14:49:20 - ConditionOccurrence - INFO - Called apply_rules
2022-06-17 14:49:20 - ConditionOccurrence - INFO - Mapped condition_concept_id
2022-06-17 14:49:20 - ConditionOccurrence - INFO - Mapped condition_end_datetime
2022-06-17 14:49:20 - ConditionOccurrence - INFO - Mapped condition_source_concept_id
2022-06-17 14:49:20 - ConditionOccurrence - INFO - Mapped condition_source_value
2022-06-17 14:49:20 - ConditionOccurrence - INFO - Mapped condition_start_datetime
2022-06-17 14:49:20 - ConditionOccurrence - INFO - Mapped person_id
2022-06-17 14:49:20 - ConditionOccurrence - WARNING - Requiring non-null values in condition_concept_id removed 872 rows, leaving 95 rows.
2022-06-17 14:49:20 - ConditionOccurrence - INFO - Automatically formatting data columns.
2022-06-17 14:49:20 - ConditionOccurrence - INFO - created df (0x10ac71e80)[Ischemic_heart_disease_3049]
2022-06-17 14:49:20 - CommonDataModel - INFO - finished Ischemic heart disease 3049 (0x10ac71e80) ... 11/12 completed, 95 rows
2022-06-17 14:49:20 - CommonDataModel - INFO - starting on Hypertensive disorder 3050
2022-06-17 14:49:20 - ConditionOccurrence - INFO - Called apply_rules
2022-06-17 14:49:20 - ConditionOccurrence - INFO - Mapped condition_concept_id
2022-06-17 14:49:20 - ConditionOccurrence - INFO - Mapped condition_end_datetime
2022-06-17 14:49:20 - ConditionOccurrence - INFO - Mapped condition_source_concept_id
2022-06-17 14:49:20 - ConditionOccurrence - INFO - Mapped condition_source_value
2022-06-17 14:49:20 - ConditionOccurrence - INFO - Mapped condition_start_datetime
2022-06-17 14:49:20 - ConditionOccurrence - INFO - Mapped person_id
2022-06-17 14:49:20 - ConditionOccurrence - WARNING - Requiring non-null values in condition_concept_id removed 886 rows, leaving 81 rows.
2022-06-17 14:49:20 - ConditionOccurrence - INFO - Automatically formatting data columns.
2022-06-17 14:49:20 - ConditionOccurrence - INFO - created df (0x10ac88a60)[Hypertensive_disorder_3050]
2022-06-17 14:49:20 - CommonDataModel - INFO - finished Hypertensive disorder 3050 (0x10ac88a60) ... 12/12 completed, 81 rows
2022-06-17 14:49:20 - CommonDataModel - INFO - saving dataframe (0x10abb9820) to <carrot.io.plugins.local.LocalDataCollection object at 0x10a847a00>
2022-06-17 14:49:20 - LocalDataCollection - INFO - saving condition_occurrence to ./test_outputs/local//condition_occurrence.csv
2022-06-17 14:49:20 - LocalDataCollection - INFO - finished save to file

2022-06-17 14:49:21 - CommonDataModel - INFO - finalised condition_occurrence on iteration 0 producing 1298 rows from 12 tables
2022-06-17 14:49:21 - LocalDataCollection - INFO - Getting next chunk of data
2022-06-17 14:49:21 - LocalDataCollection - INFO - All input files for this object have now been used.
2022-06-17 14:49:21 - LocalDataCollection - INFO - resetting used bricks
2022-06-17 14:49:21 - CommonDataModel - INFO - for drug_exposure: found 5 objects
2022-06-17 14:49:21 - CommonDataModel - INFO - working on drug_exposure
2022-06-17 14:49:21 - CommonDataModel - INFO - starting on COVID-19 vaccine 3034
2022-06-17 14:49:21 - DrugExposure - INFO - Called apply_rules
2022-06-17 14:49:21 - LocalDataCollection - INFO - Retrieving initial dataframe for 'Vaccinations.csv' for the first time
2022-06-17 14:49:21 - DrugExposure - INFO - Mapped drug_concept_id
2022-06-17 14:49:21 - DrugExposure - INFO - Mapped drug_exposure_end_datetime
2022-06-17 14:49:21 - DrugExposure - INFO - Mapped drug_exposure_start_datetime
2022-06-17 14:49:21 - DrugExposure - INFO - Mapped drug_source_concept_id
2022-06-17 14:49:21 - DrugExposure - INFO - Mapped drug_source_value
2022-06-17 14:49:21 - DrugExposure - INFO - Mapped person_id
2022-06-17 14:49:21 - DrugExposure - WARNING - Requiring non-null values in drug_concept_id removed 231 rows, leaving 129 rows.
2022-06-17 14:49:21 - DrugExposure - INFO - Automatically formatting data columns.
2022-06-17 14:49:21 - DrugExposure - INFO - created df (0x10aca01f0)[COVID_19_vaccine_3034]
2022-06-17 14:49:21 - CommonDataModel - INFO - finished COVID-19 vaccine 3034 (0x10aca01f0) ... 1/5 completed, 129 rows
2022-06-17 14:49:21 - CommonDataModel - INFO - starting on COVID-19 vaccine 3035
2022-06-17 14:49:21 - DrugExposure - INFO - Called apply_rules
2022-06-17 14:49:21 - DrugExposure - INFO - Mapped drug_concept_id
2022-06-17 14:49:21 - DrugExposure - INFO - Mapped drug_exposure_end_datetime
2022-06-17 14:49:21 - DrugExposure - INFO - Mapped drug_exposure_start_datetime
2022-06-17 14:49:21 - DrugExposure - INFO - Mapped drug_source_concept_id
2022-06-17 14:49:21 - DrugExposure - INFO - Mapped drug_source_value
2022-06-17 14:49:21 - DrugExposure - INFO - Mapped person_id
2022-06-17 14:49:21 - DrugExposure - WARNING - Requiring non-null values in drug_concept_id removed 246 rows, leaving 114 rows.
2022-06-17 14:49:21 - DrugExposure - INFO - Automatically formatting data columns.
2022-06-17 14:49:21 - DrugExposure - INFO - created df (0x10aca0970)[COVID_19_vaccine_3035]
2022-06-17 14:49:21 - CommonDataModel - INFO - finished COVID-19 vaccine 3035 (0x10aca0970) ... 2/5 completed, 114 rows
2022-06-17 14:49:21 - CommonDataModel - ERROR - There are person_ids in this table that are not in the output person table!
2022-06-17 14:49:21 - CommonDataModel - ERROR - Either they are not in the original data, or while creating the person table, 
2022-06-17 14:49:21 - CommonDataModel - ERROR - studies have been removed due to lack of required fields, such as birthdate.
2022-06-17 14:49:21 - CommonDataModel - ERROR - 112/114 were good, 2 studies are removed.
2022-06-17 14:49:21 - CommonDataModel - INFO - starting on COVID-19 vaccine 3036
2022-06-17 14:49:21 - DrugExposure - INFO - Called apply_rules
2022-06-17 14:49:21 - DrugExposure - INFO - Mapped drug_concept_id
2022-06-17 14:49:21 - DrugExposure - INFO - Mapped drug_exposure_end_datetime
2022-06-17 14:49:21 - DrugExposure - INFO - Mapped drug_exposure_start_datetime
2022-06-17 14:49:21 - DrugExposure - INFO - Mapped drug_source_concept_id
2022-06-17 14:49:21 - DrugExposure - INFO - Mapped drug_source_value
2022-06-17 14:49:21 - DrugExposure - INFO - Mapped person_id
2022-06-17 14:49:21 - DrugExposure - WARNING - Requiring non-null values in drug_concept_id removed 243 rows, leaving 117 rows.
2022-06-17 14:49:21 - DrugExposure - INFO - Automatically formatting data columns.
2022-06-17 14:49:21 - DrugExposure - INFO - created df (0x10acccd30)[COVID_19_vaccine_3036]
2022-06-17 14:49:21 - CommonDataModel - INFO - finished COVID-19 vaccine 3036 (0x10acccd30) ... 3/5 completed, 117 rows
2022-06-17 14:49:21 - CommonDataModel - INFO - starting on SARS-CoV-2 (COVID-19) vaccine, mRNA-1273 0.2 MG/ML Injectable Suspension 3040
2022-06-17 14:49:21 - DrugExposure - INFO - Called apply_rules
2022-06-17 14:49:21 - DrugExposure - INFO - Mapped drug_concept_id
2022-06-17 14:49:21 - DrugExposure - INFO - Mapped drug_exposure_end_datetime
2022-06-17 14:49:21 - DrugExposure - INFO - Mapped drug_exposure_start_datetime
2022-06-17 14:49:21 - DrugExposure - INFO - Mapped drug_source_concept_id
2022-06-17 14:49:21 - DrugExposure - INFO - Mapped drug_source_value
2022-06-17 14:49:21 - DrugExposure - INFO - Mapped person_id
2022-06-17 14:49:21 - DrugExposure - WARNING - Requiring non-null values in drug_concept_id removed 231 rows, leaving 129 rows.
2022-06-17 14:49:21 - DrugExposure - INFO - Automatically formatting data columns.
2022-06-17 14:49:21 - DrugExposure - INFO - created df (0x10acd6730)[SARS_CoV_2_COVID_19_vaccine_mRNA_1273_0_2_MG_ML_Injectable_Suspension_3040]
2022-06-17 14:49:21 - CommonDataModel - INFO - finished SARS-CoV-2 (COVID-19) vaccine, mRNA-1273 0.2 MG/ML Injectable Suspension 3040 (0x10acd6730) ... 4/5 completed, 129 rows
2022-06-17 14:49:21 - CommonDataModel - INFO - starting on SARS-CoV-2 (COVID-19) vaccine, mRNA-BNT162b2 0.1 MG/ML Injectable Suspension 3041
2022-06-17 14:49:21 - DrugExposure - INFO - Called apply_rules
2022-06-17 14:49:21 - DrugExposure - INFO - Mapped drug_concept_id
2022-06-17 14:49:21 - DrugExposure - INFO - Mapped drug_exposure_end_datetime
2022-06-17 14:49:21 - DrugExposure - INFO - Mapped drug_exposure_start_datetime
2022-06-17 14:49:21 - DrugExposure - INFO - Mapped drug_source_concept_id
2022-06-17 14:49:21 - DrugExposure - INFO - Mapped drug_source_value
2022-06-17 14:49:21 - DrugExposure - INFO - Mapped person_id
2022-06-17 14:49:22 - DrugExposure - WARNING - Requiring non-null values in drug_concept_id removed 243 rows, leaving 117 rows.
2022-06-17 14:49:22 - DrugExposure - INFO - Automatically formatting data columns.

2022-06-17 14:49:22 - DrugExposure - INFO - created df (0x10aca0610)[SARS_CoV_2_COVID_19_vaccine_mRNA_BNT162b2_0_1_MG_ML_Injectable_Suspension_3041]
2022-06-17 14:49:22 - CommonDataModel - INFO - finished SARS-CoV-2 (COVID-19) vaccine, mRNA-BNT162b2 0.1 MG/ML Injectable Suspension 3041 (0x10aca0610) ... 5/5 completed, 117 rows
2022-06-17 14:49:22 - CommonDataModel - INFO - saving dataframe (0x10ac2e6d0) to <carrot.io.plugins.local.LocalDataCollection object at 0x10a847a00>
2022-06-17 14:49:22 - LocalDataCollection - INFO - saving drug_exposure to ./test_outputs/local//drug_exposure.csv
2022-06-17 14:49:22 - LocalDataCollection - INFO - finished save to file
2022-06-17 14:49:22 - CommonDataModel - INFO - finalised drug_exposure on iteration 0 producing 604 rows from 5 tables
2022-06-17 14:49:22 - LocalDataCollection - INFO - Getting next chunk of data
2022-06-17 14:49:22 - LocalDataCollection - INFO - All input files for this object have now been used.

cdm['person'].dropna(axis=1)
gender_concept_id year_of_birth month_of_birth day_of_birth birth_datetime gender_source_value gender_source_concept_id
person_id
1 8507 1975 7 13 1975-07-13 00:00:00.000000 Male 8507
2 8507 1968 7 14 1968-07-14 00:00:00.000000 Male 8507
3 8507 1976 7 12 1976-07-12 00:00:00.000000 Male 8507
4 8507 1942 7 21 1942-07-21 00:00:00.000000 Male 8507
5 8507 1943 7 21 1943-07-21 00:00:00.000000 Male 8507
... ... ... ... ... ... ... ...
494 8532 1966 7 15 1966-07-15 00:00:00.000000 Female 8532
495 8532 1979 7 12 1979-07-12 00:00:00.000000 Female 8532
496 8532 1994 7 8 1994-07-08 00:00:00.000000 Female 8532
497 8532 1950 7 19 1950-07-19 00:00:00.000000 Female 8532
498 8532 1945 7 20 1945-07-20 00:00:00.000000 Female 8532

498 rows × 7 columns

cdm['observation'].dropna(axis=1)
person_id observation_concept_id observation_date observation_datetime observation_source_value observation_source_concept_id
observation_id
1 458 4288455 2020-11-20 2020-11-20 00:00:00.000000 61.597638175407624 4288455
2 330 4288455 2022-09-12 2022-09-12 00:00:00.000000 6.413970727863393 4288455
3 136 4288455 2019-02-02 2019-02-02 00:00:00.000000 4.691778998076871 4288455
4 17 4288455 2020-08-18 2020-08-18 00:00:00.000000 14.77684965289811 4288455
5 55 4288455 2019-11-23 2019-11-23 00:00:00.000000 79.61828150487227 4288455
... ... ... ... ... ... ...
581 279 40757663 2019-11-10 2019-11-10 00:00:00.000000 Cancer 40757663
582 494 40757663 2020-08-28 2020-08-28 00:00:00.000000 Cancer 40757663
583 282 40757663 2020-02-03 2020-02-03 00:00:00.000000 Cancer 40757663
584 284 40757663 2020-11-21 2020-11-21 00:00:00.000000 Cancer 40757663
585 284 40757663 2019-09-06 2019-09-06 00:00:00.000000 Cancer 40757663

585 rows × 6 columns

cdm.logs
{'meta': {'version': '0.0.0',
  'created_by': 'calummacdonald',
  'created_at': '2022-06-17T134916',
  'dataset': 'CommonDataModel',
  'total_data_processed': {'person': 498,
   'observation': 585,
   'condition_occurrence': 1298,
   'drug_exposure': 604}},
 'person': {'0x10a8f3c70': {'required_fields': {'gender_concept_id': {'before': 500,
     'after': 284,
     'after_formatting': 284},
    'birth_datetime': {'before': 284, 'after': 284, 'after_formatting': 284}},
   'source_files': {'birth_datetime': {'table': 'Demographics.csv',
     'field': 'Age'},
    'gender_concept_id': {'table': 'Demographics.csv', 'field': 'Sex'},
    'gender_source_concept_id': {'table': 'Demographics.csv', 'field': 'Sex'},
    'gender_source_value': {'table': 'Demographics.csv', 'field': 'Sex'},
    'person_id': {'table': 'Demographics.csv', 'field': 'ID'}},
   'original_file': '../data/part2/Demographics.csv',
   'name': 'MALE_3025'},
  '0x10a919bb0': {'required_fields': {'gender_concept_id': {'before': 500,
     'after': 214,
     'after_formatting': 214},
    'birth_datetime': {'before': 214, 'after': 214, 'after_formatting': 214}},
   'source_files': {'birth_datetime': {'table': 'Demographics.csv',
     'field': 'Age'},
    'gender_concept_id': {'table': 'Demographics.csv', 'field': 'Sex'},
    'gender_source_concept_id': {'table': 'Demographics.csv', 'field': 'Sex'},
    'gender_source_value': {'table': 'Demographics.csv', 'field': 'Sex'},
    'person_id': {'table': 'Demographics.csv', 'field': 'ID'}},
   'original_file': '../data/part2/Demographics.csv',
   'name': 'FEMALE_3026'}},
 'observation': {'0x10aba8730': {'required_fields': {'person_id': {'before': 204,
     'after': 204},
    'observation_concept_id': {'before': 204,
     'after': 204,
     'after_formatting': 204},
    'observation_datetime': {'before': 204,
     'after': 204,
     'after_formatting': 204}},
   'source_files': {'observation_concept_id': {'table': 'Serology.csv',
     'field': 'IgG'},
    'observation_datetime': {'table': 'Serology.csv', 'field': 'Date'},
    'observation_source_concept_id': {'table': 'Serology.csv', 'field': 'IgG'},
    'observation_source_value': {'table': 'Serology.csv', 'field': 'IgG'},
    'person_id': {'table': 'Serology.csv', 'field': 'ID'}},
   'original_file': '../data/part2/Serology.csv',
   'name': 'Antibody_3027',
   'valid_person_id': {'before': 204, 'after': 204}},
  '0x10aba8c70': {'required_fields': {'person_id': {'before': 600,
     'after': 600},
    'observation_concept_id': {'before': 600,
     'after': 113,
     'after_formatting': 113},
    'observation_datetime': {'before': 113,
     'after': 113,
     'after_formatting': 113}},
   'source_files': {'observation_concept_id': {'table': 'Hospital_Visit.csv',
     'field': 'reason'},
    'observation_datetime': {'table': 'Hospital_Visit.csv',
     'field': 'admission_date'},
    'observation_source_concept_id': {'table': 'Hospital_Visit.csv',
     'field': 'reason'},
    'observation_source_value': {'table': 'Hospital_Visit.csv',
     'field': 'reason'},
    'person_id': {'table': 'Hospital_Visit.csv', 'field': 'ID'}},
   'original_file': '../data/part2/Hospital_Visit.csv',
   'name': 'H_O_heart_failure_3043',
   'valid_person_id': {'before': 113, 'after': 113}},
  '0x10abb9cd0': {'required_fields': {'person_id': {'before': 600,
     'after': 600},
    'observation_concept_id': {'before': 600,
     'after': 94,
     'after_formatting': 94},
    'observation_datetime': {'before': 94,
     'after': 94,
     'after_formatting': 94}},
   'source_files': {'observation_concept_id': {'table': 'Hospital_Visit.csv',
     'field': 'reason'},
    'observation_datetime': {'table': 'Hospital_Visit.csv',
     'field': 'admission_date'},
    'observation_source_concept_id': {'table': 'Hospital_Visit.csv',
     'field': 'reason'},
    'observation_source_value': {'table': 'Hospital_Visit.csv',
     'field': 'reason'},
    'person_id': {'table': 'Hospital_Visit.csv', 'field': 'ID'}},
   'original_file': '../data/part2/Hospital_Visit.csv',
   'name': '2019_nCoV_3044',
   'valid_person_id': {'before': 94, 'after': 94}},
  '0x10abc4f10': {'required_fields': {'person_id': {'before': 600,
     'after': 600},
    'observation_concept_id': {'before': 600,
     'after': 174,
     'after_formatting': 174},
    'observation_datetime': {'before': 174,
     'after': 174,
     'after_formatting': 174}},
   'source_files': {'observation_concept_id': {'table': 'Hospital_Visit.csv',
     'field': 'reason'},
    'observation_datetime': {'table': 'Hospital_Visit.csv',
     'field': 'admission_date'},
    'observation_source_concept_id': {'table': 'Hospital_Visit.csv',
     'field': 'reason'},
    'observation_source_value': {'table': 'Hospital_Visit.csv',
     'field': 'reason'},
    'person_id': {'table': 'Hospital_Visit.csv', 'field': 'ID'}},
   'original_file': '../data/part2/Hospital_Visit.csv',
   'name': 'Cancer_3045',
   'valid_person_id': {'before': 174, 'after': 174}}},
 'condition_occurrence': {'0x10abc49a0': {'required_fields': {'person_id': {'before': 165,
     'after': 165},
    'condition_concept_id': {'before': 165,
     'after': 126,
     'after_formatting': 125},
    'condition_start_datetime': {'before': 126,
     'after': 125,
     'after_formatting': 125}},
   'source_files': {'condition_concept_id': {'table': 'Symptoms.csv',
     'field': 'Headache'},
    'condition_end_datetime': {'table': 'Symptoms.csv',
     'field': 'date_occurrence'},
    'condition_source_concept_id': {'table': 'Symptoms.csv',
     'field': 'Headache'},
    'condition_source_value': {'table': 'Symptoms.csv', 'field': 'Headache'},
    'condition_start_datetime': {'table': 'Symptoms.csv',
     'field': 'date_occurrence'},
    'person_id': {'table': 'Symptoms.csv', 'field': 'ID'}},
   'original_file': '../data/part2/Symptoms.csv',
   'name': 'Headache_3028',
   'valid_person_id': {'before': 125, 'after': 123}},
  '0x10abc4280': {'required_fields': {'person_id': {'before': 165,
     'after': 165},
    'condition_concept_id': {'before': 165,
     'after': 113,
     'after_formatting': 112},
    'condition_start_datetime': {'before': 113,
     'after': 112,
     'after_formatting': 112}},
   'source_files': {'condition_concept_id': {'table': 'Symptoms.csv',
     'field': 'Fatigue'},
    'condition_end_datetime': {'table': 'Symptoms.csv',
     'field': 'date_occurrence'},
    'condition_source_concept_id': {'table': 'Symptoms.csv',
     'field': 'Fatigue'},
    'condition_source_value': {'table': 'Symptoms.csv', 'field': 'Fatigue'},
    'condition_start_datetime': {'table': 'Symptoms.csv',
     'field': 'date_occurrence'},
    'person_id': {'table': 'Symptoms.csv', 'field': 'ID'}},
   'original_file': '../data/part2/Symptoms.csv',
   'name': 'Fatigue_3029',
   'valid_person_id': {'before': 112, 'after': 110}},
  '0x10ac069d0': {'required_fields': {'person_id': {'before': 165,
     'after': 165},
    'condition_concept_id': {'before': 165,
     'after': 77,
     'after_formatting': 77},
    'condition_start_datetime': {'before': 77,
     'after': 77,
     'after_formatting': 77}},
   'source_files': {'condition_concept_id': {'table': 'Symptoms.csv',
     'field': 'Dizzy'},
    'condition_end_datetime': {'table': 'Symptoms.csv',
     'field': 'date_occurrence'},
    'condition_source_concept_id': {'table': 'Symptoms.csv', 'field': 'Dizzy'},
    'condition_source_value': {'table': 'Symptoms.csv', 'field': 'Dizzy'},
    'condition_start_datetime': {'table': 'Symptoms.csv',
     'field': 'date_occurrence'},
    'person_id': {'table': 'Symptoms.csv', 'field': 'ID'}},
   'original_file': '../data/part2/Symptoms.csv',
   'name': 'Dizziness_3030',
   'valid_person_id': {'before': 77, 'after': 77}},
  '0x10ac0bf10': {'required_fields': {'person_id': {'before': 165,
     'after': 165},
    'condition_concept_id': {'before': 165,
     'after': 105,
     'after_formatting': 105},
    'condition_start_datetime': {'before': 105,
     'after': 105,
     'after_formatting': 105}},
   'source_files': {'condition_concept_id': {'table': 'Symptoms.csv',
     'field': 'Cough'},
    'condition_end_datetime': {'table': 'Symptoms.csv',
     'field': 'date_occurrence'},
    'condition_source_concept_id': {'table': 'Symptoms.csv', 'field': 'Cough'},
    'condition_source_value': {'table': 'Symptoms.csv', 'field': 'Cough'},
    'condition_start_datetime': {'table': 'Symptoms.csv',
     'field': 'date_occurrence'},
    'person_id': {'table': 'Symptoms.csv', 'field': 'ID'}},
   'original_file': '../data/part2/Symptoms.csv',
   'name': 'Cough_3031',
   'valid_person_id': {'before': 105, 'after': 103}},
  '0x10ac05970': {'required_fields': {'person_id': {'before': 165,
     'after': 165},
    'condition_concept_id': {'before': 165,
     'after': 37,
     'after_formatting': 37},
    'condition_start_datetime': {'before': 37,
     'after': 37,
     'after_formatting': 37}},
   'source_files': {'condition_concept_id': {'table': 'Symptoms.csv',
     'field': 'Fever'},
    'condition_end_datetime': {'table': 'Symptoms.csv',
     'field': 'date_occurrence'},
    'condition_source_concept_id': {'table': 'Symptoms.csv', 'field': 'Fever'},
    'condition_source_value': {'table': 'Symptoms.csv', 'field': 'Fever'},
    'condition_start_datetime': {'table': 'Symptoms.csv',
     'field': 'date_occurrence'},
    'person_id': {'table': 'Symptoms.csv', 'field': 'ID'}},
   'original_file': '../data/part2/Symptoms.csv',
   'name': 'Fever_3032',
   'valid_person_id': {'before': 37, 'after': 37}},
  '0x10ac2eb20': {'required_fields': {'person_id': {'before': 165,
     'after': 165},
    'condition_concept_id': {'before': 165,
     'after': 18,
     'after_formatting': 18},
    'condition_start_datetime': {'before': 18,
     'after': 18,
     'after_formatting': 18}},
   'source_files': {'condition_concept_id': {'table': 'Symptoms.csv',
     'field': 'Muscle_Pain'},
    'condition_end_datetime': {'table': 'Symptoms.csv',
     'field': 'date_occurrence'},
    'condition_source_concept_id': {'table': 'Symptoms.csv',
     'field': 'Muscle_Pain'},
    'condition_source_value': {'table': 'Symptoms.csv',
     'field': 'Muscle_Pain'},
    'condition_start_datetime': {'table': 'Symptoms.csv',
     'field': 'date_occurrence'},
    'person_id': {'table': 'Symptoms.csv', 'field': 'ID'}},
   'original_file': '../data/part2/Symptoms.csv',
   'name': 'Muscle_pain_3033',
   'valid_person_id': {'before': 18, 'after': 18}},
  '0x10ac24190': {'required_fields': {'person_id': {'before': 600,
     'after': 600},
    'condition_concept_id': {'before': 600,
     'after': 91,
     'after_formatting': 91},
    'condition_start_datetime': {'before': 91,
     'after': 91,
     'after_formatting': 91}},
   'source_files': {'condition_concept_id': {'table': 'Hospital_Visit.csv',
     'field': 'reason'},
    'condition_end_datetime': {'table': 'Hospital_Visit.csv',
     'field': 'admission_date'},
    'condition_source_concept_id': {'table': 'Hospital_Visit.csv',
     'field': 'reason'},
    'condition_source_value': {'table': 'Hospital_Visit.csv',
     'field': 'reason'},
    'condition_start_datetime': {'table': 'Hospital_Visit.csv',
     'field': 'admission_date'},
    'person_id': {'table': 'Hospital_Visit.csv', 'field': 'ID'}},
   'original_file': '../data/part2/Hospital_Visit.csv',
   'name': 'Pneumonia_3042',
   'valid_person_id': {'before': 91, 'after': 90}},
  '0x10abc4820': {'required_fields': {'person_id': {'before': 967,
     'after': 967},
    'condition_concept_id': {'before': 967,
     'after': 225,
     'after_formatting': 225},
    'condition_start_datetime': {'before': 225,
     'after': 225,
     'after_formatting': 225}},
   'source_files': {'condition_concept_id': {'table': 'GP_Records.csv',
     'field': 'comorbidity'},
    'condition_end_datetime': {'table': 'GP_Records.csv',
     'field': 'date_of_visit'},
    'condition_source_concept_id': {'table': 'GP_Records.csv',
     'field': 'comorbidity'},
    'condition_source_value': {'table': 'GP_Records.csv',
     'field': 'comorbidity'},
    'condition_start_datetime': {'table': 'GP_Records.csv',
     'field': 'date_of_visit'},
    'person_id': {'table': 'GP_Records.csv', 'field': 'ID'}},
   'original_file': '../data/part2/GP_Records.csv',
   'name': 'Mental_health_problem_3046',
   'valid_person_id': {'before': 225, 'after': 224}},
  '0x10ac3fcd0': {'required_fields': {'person_id': {'before': 967,
     'after': 967},
    'condition_concept_id': {'before': 967,
     'after': 225,
     'after_formatting': 225},
    'condition_start_datetime': {'before': 225,
     'after': 225,
     'after_formatting': 225}},
   'source_files': {'condition_concept_id': {'table': 'GP_Records.csv',
     'field': 'comorbidity'},
    'condition_end_datetime': {'table': 'GP_Records.csv',
     'field': 'date_of_visit'},
    'condition_source_concept_id': {'table': 'GP_Records.csv',
     'field': 'comorbidity'},
    'condition_source_value': {'table': 'GP_Records.csv',
     'field': 'comorbidity'},
    'condition_start_datetime': {'table': 'GP_Records.csv',
     'field': 'date_of_visit'},
    'person_id': {'table': 'GP_Records.csv', 'field': 'ID'}},
   'original_file': '../data/part2/GP_Records.csv',
   'name': 'Mental_disorder_3047',
   'valid_person_id': {'before': 225, 'after': 224}},
  '0x10ac5df40': {'required_fields': {'person_id': {'before': 967,
     'after': 967},
    'condition_concept_id': {'before': 967,
     'after': 116,
     'after_formatting': 116},
    'condition_start_datetime': {'before': 116,
     'after': 116,
     'after_formatting': 116}},
   'source_files': {'condition_concept_id': {'table': 'GP_Records.csv',
     'field': 'comorbidity'},
    'condition_end_datetime': {'table': 'GP_Records.csv',
     'field': 'date_of_visit'},
    'condition_source_concept_id': {'table': 'GP_Records.csv',
     'field': 'comorbidity'},
    'condition_source_value': {'table': 'GP_Records.csv',
     'field': 'comorbidity'},
    'condition_start_datetime': {'table': 'GP_Records.csv',
     'field': 'date_of_visit'},
    'person_id': {'table': 'GP_Records.csv', 'field': 'ID'}},
   'original_file': '../data/part2/GP_Records.csv',
   'name': 'Type_2_diabetes_mellitus_3048',
   'valid_person_id': {'before': 116, 'after': 116}},
  '0x10ac71e80': {'required_fields': {'person_id': {'before': 967,
     'after': 967},
    'condition_concept_id': {'before': 967,
     'after': 95,
     'after_formatting': 95},
    'condition_start_datetime': {'before': 95,
     'after': 95,
     'after_formatting': 95}},
   'source_files': {'condition_concept_id': {'table': 'GP_Records.csv',
     'field': 'comorbidity'},
    'condition_end_datetime': {'table': 'GP_Records.csv',
     'field': 'date_of_visit'},
    'condition_source_concept_id': {'table': 'GP_Records.csv',
     'field': 'comorbidity'},
    'condition_source_value': {'table': 'GP_Records.csv',
     'field': 'comorbidity'},
    'condition_start_datetime': {'table': 'GP_Records.csv',
     'field': 'date_of_visit'},
    'person_id': {'table': 'GP_Records.csv', 'field': 'ID'}},
   'original_file': '../data/part2/GP_Records.csv',
   'name': 'Ischemic_heart_disease_3049',
   'valid_person_id': {'before': 95, 'after': 95}},
  '0x10ac88a60': {'required_fields': {'person_id': {'before': 967,
     'after': 967},
    'condition_concept_id': {'before': 967,
     'after': 81,
     'after_formatting': 81},
    'condition_start_datetime': {'before': 81,
     'after': 81,
     'after_formatting': 81}},
   'source_files': {'condition_concept_id': {'table': 'GP_Records.csv',
     'field': 'comorbidity'},
    'condition_end_datetime': {'table': 'GP_Records.csv',
     'field': 'date_of_visit'},
    'condition_source_concept_id': {'table': 'GP_Records.csv',
     'field': 'comorbidity'},
    'condition_source_value': {'table': 'GP_Records.csv',
     'field': 'comorbidity'},
    'condition_start_datetime': {'table': 'GP_Records.csv',
     'field': 'date_of_visit'},
    'person_id': {'table': 'GP_Records.csv', 'field': 'ID'}},
   'original_file': '../data/part2/GP_Records.csv',
   'name': 'Hypertensive_disorder_3050',
   'valid_person_id': {'before': 81, 'after': 81}}},
 'drug_exposure': {'0x10aca01f0': {'required_fields': {'person_id': {'before': 360,
     'after': 360},
    'drug_concept_id': {'before': 360, 'after': 129, 'after_formatting': 129},
    'drug_exposure_start_datetime': {'before': 129,
     'after': 129,
     'after_formatting': 129}},
   'source_files': {'drug_concept_id': {'table': 'Vaccinations.csv',
     'field': 'type'},
    'drug_exposure_end_datetime': {'table': 'Vaccinations.csv',
     'field': 'date_of_vaccination'},
    'drug_exposure_start_datetime': {'table': 'Vaccinations.csv',
     'field': 'date_of_vaccination'},
    'drug_source_concept_id': {'table': 'Vaccinations.csv', 'field': 'type'},
    'drug_source_value': {'table': 'Vaccinations.csv', 'field': 'type'},
    'person_id': {'table': 'Vaccinations.csv', 'field': 'ID'}},
   'original_file': '../data/part2/Vaccinations.csv',
   'name': 'COVID_19_vaccine_3034',
   'valid_person_id': {'before': 129, 'after': 129}},
  '0x10aca0970': {'required_fields': {'person_id': {'before': 360,
     'after': 360},
    'drug_concept_id': {'before': 360, 'after': 114, 'after_formatting': 114},
    'drug_exposure_start_datetime': {'before': 114,
     'after': 114,
     'after_formatting': 114}},
   'source_files': {'drug_concept_id': {'table': 'Vaccinations.csv',
     'field': 'type'},
    'drug_exposure_end_datetime': {'table': 'Vaccinations.csv',
     'field': 'date_of_vaccination'},
    'drug_exposure_start_datetime': {'table': 'Vaccinations.csv',
     'field': 'date_of_vaccination'},
    'drug_source_concept_id': {'table': 'Vaccinations.csv', 'field': 'type'},
    'drug_source_value': {'table': 'Vaccinations.csv', 'field': 'type'},
    'person_id': {'table': 'Vaccinations.csv', 'field': 'ID'}},
   'original_file': '../data/part2/Vaccinations.csv',
   'name': 'COVID_19_vaccine_3035',
   'valid_person_id': {'before': 114, 'after': 112}},
  '0x10acccd30': {'required_fields': {'person_id': {'before': 360,
     'after': 360},
    'drug_concept_id': {'before': 360, 'after': 117, 'after_formatting': 117},
    'drug_exposure_start_datetime': {'before': 117,
     'after': 117,
     'after_formatting': 117}},
   'source_files': {'drug_concept_id': {'table': 'Vaccinations.csv',
     'field': 'type'},
    'drug_exposure_end_datetime': {'table': 'Vaccinations.csv',
     'field': 'date_of_vaccination'},
    'drug_exposure_start_datetime': {'table': 'Vaccinations.csv',
     'field': 'date_of_vaccination'},
    'drug_source_concept_id': {'table': 'Vaccinations.csv', 'field': 'type'},
    'drug_source_value': {'table': 'Vaccinations.csv', 'field': 'type'},
    'person_id': {'table': 'Vaccinations.csv', 'field': 'ID'}},
   'original_file': '../data/part2/Vaccinations.csv',
   'name': 'COVID_19_vaccine_3036',
   'valid_person_id': {'before': 117, 'after': 117}},
  '0x10acd6730': {'required_fields': {'person_id': {'before': 360,
     'after': 360},
    'drug_concept_id': {'before': 360, 'after': 129, 'after_formatting': 129},
    'drug_exposure_start_datetime': {'before': 129,
     'after': 129,
     'after_formatting': 129}},
   'source_files': {'drug_concept_id': {'table': 'Vaccinations.csv',
     'field': 'type'},
    'drug_exposure_end_datetime': {'table': 'Vaccinations.csv',
     'field': 'date_of_vaccination'},
    'drug_exposure_start_datetime': {'table': 'Vaccinations.csv',
     'field': 'date_of_vaccination'},
    'drug_source_concept_id': {'table': 'Vaccinations.csv', 'field': 'type'},
    'drug_source_value': {'table': 'Vaccinations.csv', 'field': 'type'},
    'person_id': {'table': 'Vaccinations.csv', 'field': 'ID'}},
   'original_file': '../data/part2/Vaccinations.csv',
   'name': 'SARS_CoV_2_COVID_19_vaccine_mRNA_1273_0_2_MG_ML_Injectable_Suspension_3040',
   'valid_person_id': {'before': 129, 'after': 129}},
  '0x10aca0610': {'required_fields': {'person_id': {'before': 360,
     'after': 360},
    'drug_concept_id': {'before': 360, 'after': 117, 'after_formatting': 117},
    'drug_exposure_start_datetime': {'before': 117,
     'after': 117,
     'after_formatting': 117}},
   'source_files': {'drug_concept_id': {'table': 'Vaccinations.csv',
     'field': 'type'},
    'drug_exposure_end_datetime': {'table': 'Vaccinations.csv',
     'field': 'date_of_vaccination'},
    'drug_exposure_start_datetime': {'table': 'Vaccinations.csv',
     'field': 'date_of_vaccination'},
    'drug_source_concept_id': {'table': 'Vaccinations.csv', 'field': 'type'},
    'drug_source_value': {'table': 'Vaccinations.csv', 'field': 'type'},
    'person_id': {'table': 'Vaccinations.csv', 'field': 'ID'}},
   'original_file': '../data/part2/Vaccinations.csv',
   'name': 'SARS_CoV_2_COVID_19_vaccine_mRNA_BNT162b2_0_1_MG_ML_Injectable_Suspension_3041',
   'valid_person_id': {'before': 117, 'after': 117}}}}
table = list(cdm.logs.values())[1]
data = [ obj['required_fields'] for obj in table.values()]
data
[{'gender_concept_id': {'before': 500, 'after': 284, 'after_formatting': 284},
  'birth_datetime': {'before': 284, 'after': 284, 'after_formatting': 284}},
 {'gender_concept_id': {'before': 500, 'after': 214, 'after_formatting': 214},
  'birth_datetime': {'before': 214, 'after': 214, 'after_formatting': 214}}]
person_id_map = cdm.person_id_masker
person_id_map
{'pk1002': 1,
 'pk1003': 2,
 'pk1004': 3,
 'pk1006': 4,
 'pk1009': 5,
 'pk1010': 6,
 'pk1011': 7,
 'pk1012': 8,
 'pk1013': 9,
 'pk1014': 10,
 'pk1016': 11,
 'pk1021': 12,
 'pk1022': 13,
 'pk1024': 14,
 'pk1025': 15,
 'pk1026': 16,
 'pk1027': 17,
 'pk1028': 18,
 'pk1029': 19,
 'pk1031': 20,
 'pk1032': 21,
 'pk1033': 22,
 'pk1035': 23,
 'pk1036': 24,
 'pk1038': 25,
 'pk1042': 26,
 'pk1044': 27,
 'pk1048': 28,
 'pk1049': 29,
 'pk1050': 30,
 'pk1051': 31,
 'pk1052': 32,
 'pk1053': 33,
 'pk1055': 34,
 'pk1056': 35,
 'pk1057': 36,
 'pk1059': 37,
 'pk1060': 38,
 'pk1064': 39,
 'pk1069': 40,
 'pk1071': 41,
 'pk1072': 42,
 'pk1073': 43,
 'pk1074': 44,
 'pk1075': 45,
 'pk1077': 46,
 'pk1085': 47,
 'pk1086': 48,
 'pk1088': 49,
 'pk1090': 50,
 'pk1091': 51,
 'pk1092': 52,
 'pk1094': 53,
 'pk1095': 54,
 'pk1096': 55,
 'pk1098': 56,
 'pk1099': 57,
 'pk1100': 58,
 'pk1102': 59,
 'pk1103': 60,
 'pk1104': 61,
 'pk1110': 62,
 'pk1112': 63,
 'pk1115': 64,
 'pk1117': 65,
 'pk1118': 66,
 'pk1119': 67,
 'pk1120': 68,
 'pk1122': 69,
 'pk1123': 70,
 'pk1125': 71,
 'pk1127': 72,
 'pk1128': 73,
 'pk1131': 74,
 'pk1132': 75,
 'pk1138': 76,
 'pk1140': 77,
 'pk1142': 78,
 'pk1145': 79,
 'pk1147': 80,
 'pk1148': 81,
 'pk1149': 82,
 'pk1150': 83,
 'pk1156': 84,
 'pk1157': 85,
 'pk1158': 86,
 'pk1160': 87,
 'pk1161': 88,
 'pk1167': 89,
 'pk1170': 90,
 'pk1172': 91,
 'pk1175': 92,
 'pk1176': 93,
 'pk1177': 94,
 'pk1180': 95,
 'pk1181': 96,
 'pk1185': 97,
 'pk1187': 98,
 'pk1188': 99,
 'pk1192': 100,
 'pk1194': 101,
 'pk1195': 102,
 'pk1196': 103,
 'pk1198': 104,
 'pk1200': 105,
 'pk1201': 106,
 'pk1202': 107,
 'pk1205': 108,
 'pk1207': 109,
 'pk1208': 110,
 'pk1210': 111,
 'pk1211': 112,
 'pk1216': 113,
 'pk1221': 114,
 'pk1222': 115,
 'pk1223': 116,
 'pk1226': 117,
 'pk1227': 118,
 'pk1229': 119,
 'pk1230': 120,
 'pk1231': 121,
 'pk1232': 122,
 'pk1233': 123,
 'pk1235': 124,
 'pk1236': 125,
 'pk1239': 126,
 'pk1240': 127,
 'pk1241': 128,
 'pk1242': 129,
 'pk1243': 130,
 'pk1244': 131,
 'pk1245': 132,
 'pk1246': 133,
 'pk1247': 134,
 'pk1249': 135,
 'pk1251': 136,
 'pk1253': 137,
 'pk1257': 138,
 'pk1258': 139,
 'pk1259': 140,
 'pk1260': 141,
 'pk1262': 142,
 'pk1264': 143,
 'pk1265': 144,
 'pk1266': 145,
 'pk1268': 146,
 'pk1271': 147,
 'pk1273': 148,
 'pk1274': 149,
 'pk1275': 150,
 'pk1277': 151,
 'pk1279': 152,
 'pk1283': 153,
 'pk1284': 154,
 'pk1285': 155,
 'pk1286': 156,
 'pk1287': 157,
 'pk1288': 158,
 'pk1290': 159,
 'pk1293': 160,
 'pk1294': 161,
 'pk1297': 162,
 'pk1299': 163,
 'pk1301': 164,
 'pk1302': 165,
 'pk1305': 166,
 'pk1308': 167,
 'pk1309': 168,
 'pk1310': 169,
 'pk1312': 170,
 'pk1313': 171,
 'pk1315': 172,
 'pk1318': 173,
 'pk1319': 174,
 'pk1320': 175,
 'pk1324': 176,
 'pk1325': 177,
 'pk1328': 178,
 'pk1329': 179,
 'pk1331': 180,
 'pk1332': 181,
 'pk1337': 182,
 'pk1338': 183,
 'pk1339': 184,
 'pk1340': 185,
 'pk1341': 186,
 'pk1343': 187,
 'pk1347': 188,
 'pk1350': 189,
 'pk1351': 190,
 'pk1352': 191,
 'pk1353': 192,
 'pk1357': 193,
 'pk1359': 194,
 'pk1360': 195,
 'pk1362': 196,
 'pk1364': 197,
 'pk1366': 198,
 'pk1367': 199,
 'pk1369': 200,
 'pk1370': 201,
 'pk1374': 202,
 'pk1375': 203,
 'pk1376': 204,
 'pk1378': 205,
 'pk1379': 206,
 'pk1380': 207,
 'pk1381': 208,
 'pk1382': 209,
 'pk1384': 210,
 'pk1385': 211,
 'pk1386': 212,
 'pk1387': 213,
 'pk1388': 214,
 'pk1390': 215,
 'pk1391': 216,
 'pk1392': 217,
 'pk1393': 218,
 'pk1396': 219,
 'pk1397': 220,
 'pk1398': 221,
 'pk1399': 222,
 'pk1400': 223,
 'pk1402': 224,
 'pk1403': 225,
 'pk1404': 226,
 'pk1406': 227,
 'pk1407': 228,
 'pk1409': 229,
 'pk1411': 230,
 'pk1412': 231,
 'pk1413': 232,
 'pk1414': 233,
 'pk1415': 234,
 'pk1417': 235,
 'pk1418': 236,
 'pk1419': 237,
 'pk1420': 238,
 'pk1425': 239,
 'pk1428': 240,
 'pk1430': 241,
 'pk1433': 242,
 'pk1435': 243,
 'pk1437': 244,
 'pk1438': 245,
 'pk1440': 246,
 'pk1441': 247,
 'pk1443': 248,
 'pk1444': 249,
 'pk1445': 250,
 'pk1447': 251,
 'pk1448': 252,
 'pk1449': 253,
 'pk1450': 254,
 'pk1451': 255,
 'pk1453': 256,
 'pk1454': 257,
 'pk1455': 258,
 'pk1456': 259,
 'pk1458': 260,
 'pk1459': 261,
 'pk1464': 262,
 'pk1466': 263,
 'pk1469': 264,
 'pk1471': 265,
 'pk1472': 266,
 'pk1473': 267,
 'pk1474': 268,
 'pk1475': 269,
 'pk1476': 270,
 'pk1477': 271,
 'pk1479': 272,
 'pk1481': 273,
 'pk1484': 274,
 'pk1485': 275,
 'pk1486': 276,
 'pk1488': 277,
 'pk1489': 278,
 'pk1490': 279,
 'pk1491': 280,
 'pk1492': 281,
 'pk1494': 282,
 'pk1496': 283,
 'pk1497': 284,
 'pk1001': 285,
 'pk1005': 286,
 'pk1007': 287,
 'pk1008': 288,
 'pk1015': 289,
 'pk1017': 290,
 'pk1018': 291,
 'pk1019': 292,
 'pk1020': 293,
 'pk1030': 294,
 'pk1034': 295,
 'pk1037': 296,
 'pk1039': 297,
 'pk1040': 298,
 'pk1041': 299,
 'pk1043': 300,
 'pk1045': 301,
 'pk1046': 302,
 'pk1047': 303,
 'pk1054': 304,
 'pk1058': 305,
 'pk1061': 306,
 'pk1062': 307,
 'pk1063': 308,
 'pk1065': 309,
 'pk1066': 310,
 'pk1067': 311,
 'pk1068': 312,
 'pk1070': 313,
 'pk1076': 314,
 'pk1078': 315,
 'pk1079': 316,
 'pk1080': 317,
 'pk1081': 318,
 'pk1082': 319,
 'pk1083': 320,
 'pk1084': 321,
 'pk1087': 322,
 'pk1089': 323,
 'pk1093': 324,
 'pk1097': 325,
 'pk1101': 326,
 'pk1105': 327,
 'pk1106': 328,
 'pk1107': 329,
 'pk1108': 330,
 'pk1109': 331,
 'pk1111': 332,
 'pk1113': 333,
 'pk1114': 334,
 'pk1116': 335,
 'pk1121': 336,
 'pk1124': 337,
 'pk1126': 338,
 'pk1129': 339,
 'pk1130': 340,
 'pk1133': 341,
 'pk1134': 342,
 'pk1135': 343,
 'pk1136': 344,
 'pk1137': 345,
 'pk1139': 346,
 'pk1141': 347,
 'pk1143': 348,
 'pk1144': 349,
 'pk1146': 350,
 'pk1151': 351,
 'pk1152': 352,
 'pk1153': 353,
 'pk1154': 354,
 'pk1155': 355,
 'pk1159': 356,
 'pk1162': 357,
 'pk1163': 358,
 'pk1164': 359,
 'pk1165': 360,
 'pk1166': 361,
 'pk1168': 362,
 'pk1169': 363,
 'pk1171': 364,
 'pk1173': 365,
 'pk1174': 366,
 'pk1178': 367,
 'pk1179': 368,
 'pk1182': 369,
 'pk1183': 370,
 'pk1184': 371,
 'pk1186': 372,
 'pk1189': 373,
 'pk1190': 374,
 'pk1191': 375,
 'pk1193': 376,
 'pk1199': 377,
 'pk1203': 378,
 'pk1204': 379,
 'pk1206': 380,
 'pk1209': 381,
 'pk1212': 382,
 'pk1213': 383,
 'pk1214': 384,
 'pk1215': 385,
 'pk1217': 386,
 'pk1218': 387,
 'pk1219': 388,
 'pk1220': 389,
 'pk1224': 390,
 'pk1225': 391,
 'pk1228': 392,
 'pk1234': 393,
 'pk1237': 394,
 'pk1238': 395,
 'pk1248': 396,
 'pk1250': 397,
 'pk1252': 398,
 'pk1254': 399,
 'pk1255': 400,
 'pk1256': 401,
 'pk1261': 402,
 'pk1263': 403,
 'pk1267': 404,
 'pk1269': 405,
 'pk1270': 406,
 'pk1272': 407,
 'pk1276': 408,
 'pk1278': 409,
 'pk1280': 410,
 'pk1281': 411,
 'pk1282': 412,
 'pk1289': 413,
 'pk1291': 414,
 'pk1292': 415,
 'pk1295': 416,
 'pk1296': 417,
 'pk1298': 418,
 'pk1300': 419,
 'pk1303': 420,
 'pk1304': 421,
 'pk1306': 422,
 'pk1307': 423,
 'pk1311': 424,
 'pk1314': 425,
 'pk1316': 426,
 'pk1317': 427,
 'pk1321': 428,
 'pk1322': 429,
 'pk1323': 430,
 'pk1326': 431,
 'pk1327': 432,
 'pk1330': 433,
 'pk1333': 434,
 'pk1334': 435,
 'pk1335': 436,
 'pk1336': 437,
 'pk1342': 438,
 'pk1344': 439,
 'pk1345': 440,
 'pk1346': 441,
 'pk1348': 442,
 'pk1349': 443,
 'pk1354': 444,
 'pk1355': 445,
 'pk1356': 446,
 'pk1358': 447,
 'pk1361': 448,
 'pk1363': 449,
 'pk1365': 450,
 'pk1368': 451,
 'pk1371': 452,
 'pk1372': 453,
 'pk1373': 454,
 'pk1377': 455,
 'pk1383': 456,
 'pk1389': 457,
 'pk1394': 458,
 'pk1395': 459,
 'pk1401': 460,
 'pk1405': 461,
 'pk1408': 462,
 'pk1410': 463,
 'pk1416': 464,
 'pk1421': 465,
 'pk1422': 466,
 'pk1423': 467,
 'pk1424': 468,
 'pk1426': 469,
 'pk1427': 470,
 'pk1429': 471,
 'pk1431': 472,
 'pk1432': 473,
 'pk1434': 474,
 'pk1436': 475,
 'pk1439': 476,
 'pk1442': 477,
 'pk1446': 478,
 'pk1452': 479,
 'pk1457': 480,
 'pk1460': 481,
 'pk1461': 482,
 'pk1462': 483,
 'pk1463': 484,
 'pk1465': 485,
 'pk1467': 486,
 'pk1468': 487,
 'pk1470': 488,
 'pk1478': 489,
 'pk1480': 490,
 'pk1482': 491,
 'pk1483': 492,
 'pk1487': 493,
 'pk1493': 494,
 'pk1495': 495,
 'pk1498': 496,
 'pk1499': 497,
 'pk1500': 498}

Reset and change

Reset the CDM - remove created dataframes and removing any indexing

cdm.reset()
2022-06-17 14:49:22 - LocalDataCollection - INFO - resetting used bricks

Change the default index start value for observation

cdm.set_indexing_map({'observation':10000000000})
cdm.process()
2022-06-17 14:49:22 - CommonDataModel - INFO - Starting processing in order: ['person', 'observation', 'condition_occurrence', 'drug_exposure']
2022-06-17 14:49:22 - CommonDataModel - INFO - Number of objects to process for each table...
{
      "person": 2,
      "observation": 4,
      "condition_occurrence": 12,
      "drug_exposure": 5
}
2022-06-17 14:49:22 - CommonDataModel - INFO - for person: found 2 objects
2022-06-17 14:49:22 - CommonDataModel - INFO - working on person
2022-06-17 14:49:22 - CommonDataModel - INFO - starting on MALE 3025
2022-06-17 14:49:22 - CommonDataModel - WARNING - {'observation': 10000000000}
2022-06-17 14:49:22 - CommonDataModel - WARNING - indexing configuration has be parsed but this table (person) has not be passed, so starting from 1
2022-06-17 14:49:22 - Person - INFO - Called apply_rules
2022-06-17 14:49:22 - LocalDataCollection - INFO - Retrieving initial dataframe for 'Demographics.csv' for the first time
2022-06-17 14:49:22 - Person - INFO - Mapped birth_datetime
2022-06-17 14:49:22 - Person - INFO - Mapped gender_concept_id
2022-06-17 14:49:22 - Person - INFO - Mapped gender_source_concept_id
2022-06-17 14:49:22 - Person - INFO - Mapped gender_source_value
2022-06-17 14:49:22 - Person - INFO - Mapped person_id
2022-06-17 14:49:22 - Person - WARNING - Requiring non-null values in gender_concept_id removed 216 rows, leaving 284 rows.
2022-06-17 14:49:22 - Person - INFO - Automatically formatting data columns.
2022-06-17 14:49:22 - Person - INFO - created df (0x10ac2ed60)[MALE_3025]
2022-06-17 14:49:22 - CommonDataModel - INFO - finished MALE 3025 (0x10ac2ed60) ... 1/2 completed, 284 rows
2022-06-17 14:49:22 - CommonDataModel - WARNING - {'observation': 10000000000}
2022-06-17 14:49:22 - CommonDataModel - WARNING - indexing configuration has be parsed but this table (person) has not be passed, so starting from 1
2022-06-17 14:49:22 - LocalDataCollection - INFO - saving person_ids to ./test_outputs/local//person_ids.csv
2022-06-17 14:49:22 - LocalDataCollection - INFO - finished save to file
2022-06-17 14:49:22 - CommonDataModel - INFO - starting on FEMALE 3026
2022-06-17 14:49:22 - CommonDataModel - WARNING - {'observation': 10000000000}
2022-06-17 14:49:22 - CommonDataModel - WARNING - indexing configuration has be parsed but this table (person) has not be passed, so starting from 1
2022-06-17 14:49:22 - Person - INFO - Called apply_rules
2022-06-17 14:49:23 - Person - INFO - Mapped birth_datetime
2022-06-17 14:49:23 - Person - INFO - Mapped gender_concept_id
2022-06-17 14:49:23 - Person - INFO - Mapped gender_source_concept_id
2022-06-17 14:49:23 - Person - INFO - Mapped gender_source_value
2022-06-17 14:49:23 - Person - INFO - Mapped person_id
2022-06-17 14:49:23 - Person - WARNING - Requiring non-null values in gender_concept_id removed 286 rows, leaving 214 rows.
2022-06-17 14:49:23 - Person - INFO - Automatically formatting data columns.
2022-06-17 14:49:23 - Person - INFO - created df (0x10ac244f0)[FEMALE_3026]
2022-06-17 14:49:23 - CommonDataModel - INFO - finished FEMALE 3026 (0x10ac244f0) ... 2/2 completed, 214 rows
2022-06-17 14:49:23 - LocalDataCollection - INFO - updating person_ids in ./test_outputs/local//person_ids.csv
2022-06-17 14:49:23 - LocalDataCollection - INFO - finished save to file
2022-06-17 14:49:23 - CommonDataModel - INFO - saving dataframe (0x10ac24eb0) to <carrot.io.plugins.local.LocalDataCollection object at 0x10a847a00>
2022-06-17 14:49:23 - LocalDataCollection - INFO - saving person to ./test_outputs/local//person.csv
2022-06-17 14:49:23 - LocalDataCollection - INFO - finished save to file
2022-06-17 14:49:23 - CommonDataModel - INFO - finalised person on iteration 0 producing 498 rows from 2 tables
2022-06-17 14:49:23 - LocalDataCollection - INFO - Getting next chunk of data
2022-06-17 14:49:23 - LocalDataCollection - INFO - All input files for this object have now been used.
2022-06-17 14:49:23 - LocalDataCollection - INFO - resetting used bricks
2022-06-17 14:49:23 - CommonDataModel - INFO - for observation: found 4 objects
2022-06-17 14:49:23 - CommonDataModel - INFO - working on observation
2022-06-17 14:49:23 - CommonDataModel - INFO - starting on Antibody 3027
2022-06-17 14:49:23 - Observation - INFO - Called apply_rules
2022-06-17 14:49:23 - LocalDataCollection - INFO - Retrieving initial dataframe for 'Serology.csv' for the first time
2022-06-17 14:49:23 - Observation - INFO - Mapped observation_concept_id
2022-06-17 14:49:23 - Observation - INFO - Mapped observation_datetime
2022-06-17 14:49:23 - Observation - INFO - Mapped observation_source_concept_id
2022-06-17 14:49:23 - Observation - INFO - Mapped observation_source_value
2022-06-17 14:49:23 - Observation - INFO - Mapped person_id
2022-06-17 14:49:23 - Observation - INFO - Automatically formatting data columns.
2022-06-17 14:49:23 - Observation - INFO - created df (0x10ac71880)[Antibody_3027]
2022-06-17 14:49:23 - CommonDataModel - INFO - finished Antibody 3027 (0x10ac71880) ... 1/4 completed, 204 rows
2022-06-17 14:49:23 - CommonDataModel - INFO - starting on H/O: heart failure 3043
2022-06-17 14:49:23 - Observation - INFO - Called apply_rules
2022-06-17 14:49:23 - LocalDataCollection - INFO - Retrieving initial dataframe for 'Hospital_Visit.csv' for the first time
2022-06-17 14:49:23 - Observation - INFO - Mapped observation_concept_id
2022-06-17 14:49:23 - Observation - INFO - Mapped observation_datetime
2022-06-17 14:49:23 - Observation - INFO - Mapped observation_source_concept_id
2022-06-17 14:49:23 - Observation - INFO - Mapped observation_source_value
2022-06-17 14:49:23 - Observation - INFO - Mapped person_id
2022-06-17 14:49:23 - Observation - WARNING - Requiring non-null values in observation_concept_id removed 487 rows, leaving 113 rows.
2022-06-17 14:49:23 - Observation - INFO - Automatically formatting data columns.
2022-06-17 14:49:23 - Observation - INFO - created df (0x10abb9820)[H_O_heart_failure_3043]
2022-06-17 14:49:23 - CommonDataModel - INFO - finished H/O: heart failure 3043 (0x10abb9820) ... 2/4 completed, 113 rows
2022-06-17 14:49:23 - CommonDataModel - INFO - starting on 2019-nCoV 3044

2022-06-17 14:49:23 - Observation - INFO - Called apply_rules
2022-06-17 14:49:23 - Observation - INFO - Mapped observation_concept_id
2022-06-17 14:49:23 - Observation - INFO - Mapped observation_datetime
2022-06-17 14:49:23 - Observation - INFO - Mapped observation_source_concept_id
2022-06-17 14:49:23 - Observation - INFO - Mapped observation_source_value
2022-06-17 14:49:23 - Observation - INFO - Mapped person_id
2022-06-17 14:49:23 - Observation - WARNING - Requiring non-null values in observation_concept_id removed 506 rows, leaving 94 rows.
2022-06-17 14:49:23 - Observation - INFO - Automatically formatting data columns.
2022-06-17 14:49:23 - Observation - INFO - created df (0x10aca08e0)[2019_nCoV_3044]
2022-06-17 14:49:23 - CommonDataModel - INFO - finished 2019-nCoV 3044 (0x10aca08e0) ... 3/4 completed, 94 rows
2022-06-17 14:49:24 - CommonDataModel - INFO - starting on Cancer 3045
2022-06-17 14:49:24 - Observation - INFO - Called apply_rules
2022-06-17 14:49:24 - Observation - INFO - Mapped observation_concept_id
2022-06-17 14:49:24 - Observation - INFO - Mapped observation_datetime
2022-06-17 14:49:24 - Observation - INFO - Mapped observation_source_concept_id
2022-06-17 14:49:24 - Observation - INFO - Mapped observation_source_value
2022-06-17 14:49:24 - Observation - INFO - Mapped person_id
2022-06-17 14:49:24 - Observation - WARNING - Requiring non-null values in observation_concept_id removed 426 rows, leaving 174 rows.
2022-06-17 14:49:24 - Observation - INFO - Automatically formatting data columns.
2022-06-17 14:49:24 - Observation - INFO - created df (0x10aba89a0)[Cancer_3045]
2022-06-17 14:49:24 - CommonDataModel - INFO - finished Cancer 3045 (0x10aba89a0) ... 4/4 completed, 174 rows
2022-06-17 14:49:24 - CommonDataModel - INFO - saving dataframe (0x10a8f3670) to <carrot.io.plugins.local.LocalDataCollection object at 0x10a847a00>
2022-06-17 14:49:24 - LocalDataCollection - INFO - saving observation to ./test_outputs/local//observation.csv
2022-06-17 14:49:24 - LocalDataCollection - INFO - finished save to file
2022-06-17 14:49:24 - CommonDataModel - INFO - finalised observation on iteration 0 producing 585 rows from 4 tables
2022-06-17 14:49:24 - LocalDataCollection - INFO - Getting next chunk of data
2022-06-17 14:49:24 - LocalDataCollection - INFO - All input files for this object have now been used.
2022-06-17 14:49:24 - LocalDataCollection - INFO - resetting used bricks
2022-06-17 14:49:24 - CommonDataModel - INFO - for condition_occurrence: found 12 objects
2022-06-17 14:49:24 - CommonDataModel - INFO - working on condition_occurrence
2022-06-17 14:49:24 - CommonDataModel - INFO - starting on Headache 3028
2022-06-17 14:49:24 - CommonDataModel - WARNING - {'observation': 10000000000}
2022-06-17 14:49:24 - CommonDataModel - WARNING - indexing configuration has be parsed but this table (condition_occurrence) has not be passed, so starting from 1
2022-06-17 14:49:24 - ConditionOccurrence - INFO - Called apply_rules
2022-06-17 14:49:24 - LocalDataCollection - INFO - Retrieving initial dataframe for 'Symptoms.csv' for the first time
2022-06-17 14:49:24 - ConditionOccurrence - INFO - Mapped condition_concept_id
2022-06-17 14:49:24 - ConditionOccurrence - INFO - Mapped condition_end_datetime
2022-06-17 14:49:24 - ConditionOccurrence - INFO - Mapped condition_source_concept_id
2022-06-17 14:49:24 - ConditionOccurrence - INFO - Mapped condition_source_value
2022-06-17 14:49:24 - ConditionOccurrence - INFO - Mapped condition_start_datetime
2022-06-17 14:49:24 - ConditionOccurrence - INFO - Mapped person_id
2022-06-17 14:49:24 - ConditionOccurrence - WARNING - Requiring non-null values in condition_concept_id removed 39 rows, leaving 126 rows.
2022-06-17 14:49:24 - ConditionOccurrence - WARNING - Requiring non-null values in condition_start_datetime removed 1 rows, leaving 125 rows.
2022-06-17 14:49:24 - ConditionOccurrence - INFO - Automatically formatting data columns.
2022-06-17 14:49:24 - ConditionOccurrence - INFO - created df (0x10ac2e490)[Headache_3028]
2022-06-17 14:49:24 - CommonDataModel - INFO - finished Headache 3028 (0x10ac2e490) ... 1/12 completed, 125 rows
2022-06-17 14:49:24 - CommonDataModel - ERROR - There are person_ids in this table that are not in the output person table!
2022-06-17 14:49:24 - CommonDataModel - ERROR - Either they are not in the original data, or while creating the person table, 
2022-06-17 14:49:24 - CommonDataModel - ERROR - studies have been removed due to lack of required fields, such as birthdate.
2022-06-17 14:49:24 - CommonDataModel - ERROR - 123/125 were good, 2 studies are removed.
2022-06-17 14:49:24 - CommonDataModel - INFO - starting on Fatigue 3029
2022-06-17 14:49:24 - CommonDataModel - WARNING - {'observation': 10000000000}
2022-06-17 14:49:24 - CommonDataModel - WARNING - indexing configuration has be parsed but this table (condition_occurrence) has not be passed, so starting from 1
2022-06-17 14:49:24 - ConditionOccurrence - INFO - Called apply_rules
2022-06-17 14:49:24 - ConditionOccurrence - INFO - Mapped condition_concept_id
2022-06-17 14:49:24 - ConditionOccurrence - INFO - Mapped condition_end_datetime
2022-06-17 14:49:24 - ConditionOccurrence - INFO - Mapped condition_source_concept_id
2022-06-17 14:49:24 - ConditionOccurrence - INFO - Mapped condition_source_value
2022-06-17 14:49:24 - ConditionOccurrence - INFO - Mapped condition_start_datetime
2022-06-17 14:49:24 - ConditionOccurrence - INFO - Mapped person_id
2022-06-17 14:49:24 - ConditionOccurrence - WARNING - Requiring non-null values in condition_concept_id removed 52 rows, leaving 113 rows.
2022-06-17 14:49:24 - ConditionOccurrence - WARNING - Requiring non-null values in condition_start_datetime removed 1 rows, leaving 112 rows.
2022-06-17 14:49:24 - ConditionOccurrence - INFO - Automatically formatting data columns.
2022-06-17 14:49:24 - ConditionOccurrence - INFO - created df (0x10a88fee0)[Fatigue_3029]
2022-06-17 14:49:24 - CommonDataModel - INFO - finished Fatigue 3029 (0x10a88fee0) ... 2/12 completed, 112 rows
2022-06-17 14:49:24 - CommonDataModel - ERROR - There are person_ids in this table that are not in the output person table!

2022-06-17 14:49:24 - CommonDataModel - ERROR - Either they are not in the original data, or while creating the person table, 
2022-06-17 14:49:24 - CommonDataModel - ERROR - studies have been removed due to lack of required fields, such as birthdate.
2022-06-17 14:49:24 - CommonDataModel - ERROR - 110/112 were good, 2 studies are removed.
2022-06-17 14:49:24 - CommonDataModel - INFO - starting on Dizziness 3030
2022-06-17 14:49:24 - CommonDataModel - WARNING - {'observation': 10000000000}
2022-06-17 14:49:24 - CommonDataModel - WARNING - indexing configuration has be parsed but this table (condition_occurrence) has not be passed, so starting from 1
2022-06-17 14:49:24 - ConditionOccurrence - INFO - Called apply_rules
2022-06-17 14:49:24 - ConditionOccurrence - INFO - Mapped condition_concept_id
2022-06-17 14:49:24 - ConditionOccurrence - INFO - Mapped condition_end_datetime
2022-06-17 14:49:24 - ConditionOccurrence - INFO - Mapped condition_source_concept_id
2022-06-17 14:49:24 - ConditionOccurrence - INFO - Mapped condition_source_value
2022-06-17 14:49:24 - ConditionOccurrence - INFO - Mapped condition_start_datetime
2022-06-17 14:49:24 - ConditionOccurrence - INFO - Mapped person_id
2022-06-17 14:49:24 - ConditionOccurrence - WARNING - Requiring non-null values in condition_concept_id removed 88 rows, leaving 77 rows.
2022-06-17 14:49:24 - ConditionOccurrence - INFO - Automatically formatting data columns.
2022-06-17 14:49:24 - ConditionOccurrence - INFO - created df (0x10ac246d0)[Dizziness_3030]
2022-06-17 14:49:24 - CommonDataModel - INFO - finished Dizziness 3030 (0x10ac246d0) ... 3/12 completed, 77 rows
2022-06-17 14:49:24 - CommonDataModel - INFO - starting on Cough 3031
2022-06-17 14:49:24 - CommonDataModel - WARNING - {'observation': 10000000000}
2022-06-17 14:49:24 - CommonDataModel - WARNING - indexing configuration has be parsed but this table (condition_occurrence) has not be passed, so starting from 1
2022-06-17 14:49:24 - ConditionOccurrence - INFO - Called apply_rules
2022-06-17 14:49:24 - ConditionOccurrence - INFO - Mapped condition_concept_id
2022-06-17 14:49:24 - ConditionOccurrence - INFO - Mapped condition_end_datetime
2022-06-17 14:49:24 - ConditionOccurrence - INFO - Mapped condition_source_concept_id
2022-06-17 14:49:24 - ConditionOccurrence - INFO - Mapped condition_source_value
2022-06-17 14:49:24 - ConditionOccurrence - INFO - Mapped condition_start_datetime
2022-06-17 14:49:24 - ConditionOccurrence - INFO - Mapped person_id
2022-06-17 14:49:24 - ConditionOccurrence - WARNING - Requiring non-null values in condition_concept_id removed 60 rows, leaving 105 rows.
2022-06-17 14:49:24 - ConditionOccurrence - INFO - Automatically formatting data columns.
2022-06-17 14:49:24 - ConditionOccurrence - INFO - created df (0x10abaa3d0)[Cough_3031]
2022-06-17 14:49:24 - CommonDataModel - INFO - finished Cough 3031 (0x10abaa3d0) ... 4/12 completed, 105 rows
2022-06-17 14:49:24 - CommonDataModel - ERROR - There are person_ids in this table that are not in the output person table!
2022-06-17 14:49:24 - CommonDataModel - ERROR - Either they are not in the original data, or while creating the person table, 
2022-06-17 14:49:24 - CommonDataModel - ERROR - studies have been removed due to lack of required fields, such as birthdate.
2022-06-17 14:49:24 - CommonDataModel - ERROR - 103/105 were good, 2 studies are removed.
2022-06-17 14:49:24 - CommonDataModel - INFO - starting on Fever 3032
2022-06-17 14:49:24 - CommonDataModel - WARNING - {'observation': 10000000000}
2022-06-17 14:49:24 - CommonDataModel - WARNING - indexing configuration has be parsed but this table (condition_occurrence) has not be passed, so starting from 1
2022-06-17 14:49:24 - ConditionOccurrence - INFO - Called apply_rules
2022-06-17 14:49:24 - ConditionOccurrence - INFO - Mapped condition_concept_id
2022-06-17 14:49:24 - ConditionOccurrence - INFO - Mapped condition_end_datetime
2022-06-17 14:49:24 - ConditionOccurrence - INFO - Mapped condition_source_concept_id
2022-06-17 14:49:24 - ConditionOccurrence - INFO - Mapped condition_source_value
2022-06-17 14:49:24 - ConditionOccurrence - INFO - Mapped condition_start_datetime
2022-06-17 14:49:24 - ConditionOccurrence - INFO - Mapped person_id
2022-06-17 14:49:24 - ConditionOccurrence - WARNING - Requiring non-null values in condition_concept_id removed 128 rows, leaving 37 rows.
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Automatically formatting data columns.
2022-06-17 14:49:25 - ConditionOccurrence - INFO - created df (0x10a88f700)[Fever_3032]
2022-06-17 14:49:25 - CommonDataModel - INFO - finished Fever 3032 (0x10a88f700) ... 5/12 completed, 37 rows
2022-06-17 14:49:25 - CommonDataModel - INFO - starting on Muscle pain 3033
2022-06-17 14:49:25 - CommonDataModel - WARNING - {'observation': 10000000000}
2022-06-17 14:49:25 - CommonDataModel - WARNING - indexing configuration has be parsed but this table (condition_occurrence) has not be passed, so starting from 1
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Called apply_rules
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Mapped condition_concept_id
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Mapped condition_end_datetime
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Mapped condition_source_concept_id
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Mapped condition_source_value
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Mapped condition_start_datetime
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Mapped person_id
2022-06-17 14:49:25 - ConditionOccurrence - WARNING - Requiring non-null values in condition_concept_id removed 147 rows, leaving 18 rows.
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Automatically formatting data columns.
2022-06-17 14:49:25 - ConditionOccurrence - INFO - created df (0x10abc4be0)[Muscle_pain_3033]
2022-06-17 14:49:25 - CommonDataModel - INFO - finished Muscle pain 3033 (0x10abc4be0) ... 6/12 completed, 18 rows
2022-06-17 14:49:25 - CommonDataModel - INFO - starting on Pneumonia 3042
2022-06-17 14:49:25 - CommonDataModel - WARNING - {'observation': 10000000000}

2022-06-17 14:49:25 - CommonDataModel - WARNING - indexing configuration has be parsed but this table (condition_occurrence) has not be passed, so starting from 1
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Called apply_rules
2022-06-17 14:49:25 - LocalDataCollection - INFO - Retrieving initial dataframe for 'Hospital_Visit.csv' for the first time
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Mapped condition_concept_id
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Mapped condition_end_datetime
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Mapped condition_source_concept_id
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Mapped condition_source_value
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Mapped condition_start_datetime
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Mapped person_id
2022-06-17 14:49:25 - ConditionOccurrence - WARNING - Requiring non-null values in condition_concept_id removed 509 rows, leaving 91 rows.
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Automatically formatting data columns.
2022-06-17 14:49:25 - ConditionOccurrence - INFO - created df (0x10ac06fd0)[Pneumonia_3042]
2022-06-17 14:49:25 - CommonDataModel - INFO - finished Pneumonia 3042 (0x10ac06fd0) ... 7/12 completed, 91 rows
2022-06-17 14:49:25 - CommonDataModel - ERROR - There are person_ids in this table that are not in the output person table!
2022-06-17 14:49:25 - CommonDataModel - ERROR - Either they are not in the original data, or while creating the person table, 
2022-06-17 14:49:25 - CommonDataModel - ERROR - studies have been removed due to lack of required fields, such as birthdate.
2022-06-17 14:49:25 - CommonDataModel - ERROR - 90/91 were good, 1 studies are removed.
2022-06-17 14:49:25 - CommonDataModel - INFO - starting on Mental health problem 3046
2022-06-17 14:49:25 - CommonDataModel - WARNING - {'observation': 10000000000}
2022-06-17 14:49:25 - CommonDataModel - WARNING - indexing configuration has be parsed but this table (condition_occurrence) has not be passed, so starting from 1
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Called apply_rules
2022-06-17 14:49:25 - LocalDataCollection - INFO - Retrieving initial dataframe for 'GP_Records.csv' for the first time
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Mapped condition_concept_id
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Mapped condition_end_datetime
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Mapped condition_source_concept_id
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Mapped condition_source_value
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Mapped condition_start_datetime
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Mapped person_id
2022-06-17 14:49:25 - ConditionOccurrence - WARNING - Requiring non-null values in condition_concept_id removed 742 rows, leaving 225 rows.
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Automatically formatting data columns.
2022-06-17 14:49:25 - ConditionOccurrence - INFO - created df (0x10a88f9d0)[Mental_health_problem_3046]
2022-06-17 14:49:25 - CommonDataModel - INFO - finished Mental health problem 3046 (0x10a88f9d0) ... 8/12 completed, 225 rows
2022-06-17 14:49:25 - CommonDataModel - ERROR - There are person_ids in this table that are not in the output person table!
2022-06-17 14:49:25 - CommonDataModel - ERROR - Either they are not in the original data, or while creating the person table, 
2022-06-17 14:49:25 - CommonDataModel - ERROR - studies have been removed due to lack of required fields, such as birthdate.
2022-06-17 14:49:25 - CommonDataModel - ERROR - 224/225 were good, 1 studies are removed.
2022-06-17 14:49:25 - CommonDataModel - INFO - starting on Mental disorder 3047
2022-06-17 14:49:25 - CommonDataModel - WARNING - {'observation': 10000000000}
2022-06-17 14:49:25 - CommonDataModel - WARNING - indexing configuration has be parsed but this table (condition_occurrence) has not be passed, so starting from 1
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Called apply_rules
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Mapped condition_concept_id
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Mapped condition_end_datetime
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Mapped condition_source_concept_id
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Mapped condition_source_value
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Mapped condition_start_datetime
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Mapped person_id
2022-06-17 14:49:25 - ConditionOccurrence - WARNING - Requiring non-null values in condition_concept_id removed 742 rows, leaving 225 rows.
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Automatically formatting data columns.
2022-06-17 14:49:25 - ConditionOccurrence - INFO - created df (0x10abf67f0)[Mental_disorder_3047]
2022-06-17 14:49:25 - CommonDataModel - INFO - finished Mental disorder 3047 (0x10abf67f0) ... 9/12 completed, 225 rows
2022-06-17 14:49:25 - CommonDataModel - ERROR - There are person_ids in this table that are not in the output person table!
2022-06-17 14:49:25 - CommonDataModel - ERROR - Either they are not in the original data, or while creating the person table, 
2022-06-17 14:49:25 - CommonDataModel - ERROR - studies have been removed due to lack of required fields, such as birthdate.
2022-06-17 14:49:25 - CommonDataModel - ERROR - 224/225 were good, 1 studies are removed.
2022-06-17 14:49:25 - CommonDataModel - INFO - starting on Type 2 diabetes mellitus 3048
2022-06-17 14:49:25 - CommonDataModel - WARNING - {'observation': 10000000000}
2022-06-17 14:49:25 - CommonDataModel - WARNING - indexing configuration has be parsed but this table (condition_occurrence) has not be passed, so starting from 1
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Called apply_rules
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Mapped condition_concept_id
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Mapped condition_end_datetime
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Mapped condition_source_concept_id
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Mapped condition_source_value
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Mapped condition_start_datetime

2022-06-17 14:49:25 - ConditionOccurrence - INFO - Mapped person_id
2022-06-17 14:49:25 - ConditionOccurrence - WARNING - Requiring non-null values in condition_concept_id removed 851 rows, leaving 116 rows.
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Automatically formatting data columns.
2022-06-17 14:49:25 - ConditionOccurrence - INFO - created df (0x10aba3910)[Type_2_diabetes_mellitus_3048]
2022-06-17 14:49:25 - CommonDataModel - INFO - finished Type 2 diabetes mellitus 3048 (0x10aba3910) ... 10/12 completed, 116 rows
2022-06-17 14:49:25 - CommonDataModel - INFO - starting on Ischemic heart disease 3049
2022-06-17 14:49:25 - CommonDataModel - WARNING - {'observation': 10000000000}
2022-06-17 14:49:25 - CommonDataModel - WARNING - indexing configuration has be parsed but this table (condition_occurrence) has not be passed, so starting from 1
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Called apply_rules
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Mapped condition_concept_id
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Mapped condition_end_datetime
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Mapped condition_source_concept_id
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Mapped condition_source_value
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Mapped condition_start_datetime
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Mapped person_id
2022-06-17 14:49:25 - ConditionOccurrence - WARNING - Requiring non-null values in condition_concept_id removed 872 rows, leaving 95 rows.
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Automatically formatting data columns.
2022-06-17 14:49:25 - ConditionOccurrence - INFO - created df (0x10abc49d0)[Ischemic_heart_disease_3049]
2022-06-17 14:49:25 - CommonDataModel - INFO - finished Ischemic heart disease 3049 (0x10abc49d0) ... 11/12 completed, 95 rows
2022-06-17 14:49:25 - CommonDataModel - INFO - starting on Hypertensive disorder 3050
2022-06-17 14:49:25 - CommonDataModel - WARNING - {'observation': 10000000000}
2022-06-17 14:49:25 - CommonDataModel - WARNING - indexing configuration has be parsed but this table (condition_occurrence) has not be passed, so starting from 1
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Called apply_rules
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Mapped condition_concept_id
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Mapped condition_end_datetime
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Mapped condition_source_concept_id
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Mapped condition_source_value
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Mapped condition_start_datetime
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Mapped person_id
2022-06-17 14:49:25 - ConditionOccurrence - WARNING - Requiring non-null values in condition_concept_id removed 886 rows, leaving 81 rows.
2022-06-17 14:49:25 - ConditionOccurrence - INFO - Automatically formatting data columns.
2022-06-17 14:49:25 - ConditionOccurrence - INFO - created df (0x10a919040)[Hypertensive_disorder_3050]
2022-06-17 14:49:25 - CommonDataModel - INFO - finished Hypertensive disorder 3050 (0x10a919040) ... 12/12 completed, 81 rows
2022-06-17 14:49:26 - CommonDataModel - INFO - saving dataframe (0x10acbb3d0) to <carrot.io.plugins.local.LocalDataCollection object at 0x10a847a00>
2022-06-17 14:49:26 - LocalDataCollection - INFO - saving condition_occurrence to ./test_outputs/local//condition_occurrence.csv
2022-06-17 14:49:26 - LocalDataCollection - INFO - finished save to file
2022-06-17 14:49:26 - CommonDataModel - INFO - finalised condition_occurrence on iteration 0 producing 1298 rows from 12 tables
2022-06-17 14:49:26 - LocalDataCollection - INFO - Getting next chunk of data
2022-06-17 14:49:26 - LocalDataCollection - INFO - All input files for this object have now been used.
2022-06-17 14:49:26 - LocalDataCollection - INFO - resetting used bricks
2022-06-17 14:49:26 - CommonDataModel - INFO - for drug_exposure: found 5 objects
2022-06-17 14:49:26 - CommonDataModel - INFO - working on drug_exposure
2022-06-17 14:49:26 - CommonDataModel - INFO - starting on COVID-19 vaccine 3034
2022-06-17 14:49:26 - CommonDataModel - WARNING - {'observation': 10000000000}
2022-06-17 14:49:26 - CommonDataModel - WARNING - indexing configuration has be parsed but this table (drug_exposure) has not be passed, so starting from 1
2022-06-17 14:49:26 - DrugExposure - INFO - Called apply_rules
2022-06-17 14:49:26 - LocalDataCollection - INFO - Retrieving initial dataframe for 'Vaccinations.csv' for the first time
2022-06-17 14:49:26 - DrugExposure - INFO - Mapped drug_concept_id
2022-06-17 14:49:26 - DrugExposure - INFO - Mapped drug_exposure_end_datetime
2022-06-17 14:49:26 - DrugExposure - INFO - Mapped drug_exposure_start_datetime
2022-06-17 14:49:26 - DrugExposure - INFO - Mapped drug_source_concept_id
2022-06-17 14:49:26 - DrugExposure - INFO - Mapped drug_source_value
2022-06-17 14:49:26 - DrugExposure - INFO - Mapped person_id
2022-06-17 14:49:26 - DrugExposure - WARNING - Requiring non-null values in drug_concept_id removed 231 rows, leaving 129 rows.
2022-06-17 14:49:26 - DrugExposure - INFO - Automatically formatting data columns.
2022-06-17 14:49:26 - DrugExposure - INFO - created df (0x10ace3550)[COVID_19_vaccine_3034]
2022-06-17 14:49:26 - CommonDataModel - INFO - finished COVID-19 vaccine 3034 (0x10ace3550) ... 1/5 completed, 129 rows
2022-06-17 14:49:26 - CommonDataModel - INFO - starting on COVID-19 vaccine 3035
2022-06-17 14:49:26 - CommonDataModel - WARNING - {'observation': 10000000000}
2022-06-17 14:49:26 - CommonDataModel - WARNING - indexing configuration has be parsed but this table (drug_exposure) has not be passed, so starting from 1
2022-06-17 14:49:26 - DrugExposure - INFO - Called apply_rules
2022-06-17 14:49:26 - DrugExposure - INFO - Mapped drug_concept_id
2022-06-17 14:49:26 - DrugExposure - INFO - Mapped drug_exposure_end_datetime
2022-06-17 14:49:26 - DrugExposure - INFO - Mapped drug_exposure_start_datetime
2022-06-17 14:49:26 - DrugExposure - INFO - Mapped drug_source_concept_id
2022-06-17 14:49:26 - DrugExposure - INFO - Mapped drug_source_value

2022-06-17 14:49:26 - DrugExposure - INFO - Mapped person_id
2022-06-17 14:49:26 - DrugExposure - WARNING - Requiring non-null values in drug_concept_id removed 246 rows, leaving 114 rows.
2022-06-17 14:49:26 - DrugExposure - INFO - Automatically formatting data columns.
2022-06-17 14:49:26 - DrugExposure - INFO - created df (0x10ace3eb0)[COVID_19_vaccine_3035]
2022-06-17 14:49:26 - CommonDataModel - INFO - finished COVID-19 vaccine 3035 (0x10ace3eb0) ... 2/5 completed, 114 rows
2022-06-17 14:49:26 - CommonDataModel - ERROR - There are person_ids in this table that are not in the output person table!
2022-06-17 14:49:26 - CommonDataModel - ERROR - Either they are not in the original data, or while creating the person table, 
2022-06-17 14:49:26 - CommonDataModel - ERROR - studies have been removed due to lack of required fields, such as birthdate.
2022-06-17 14:49:26 - CommonDataModel - ERROR - 112/114 were good, 2 studies are removed.
2022-06-17 14:49:26 - CommonDataModel - INFO - starting on COVID-19 vaccine 3036
2022-06-17 14:49:26 - CommonDataModel - WARNING - {'observation': 10000000000}
2022-06-17 14:49:26 - CommonDataModel - WARNING - indexing configuration has be parsed but this table (drug_exposure) has not be passed, so starting from 1
2022-06-17 14:49:26 - DrugExposure - INFO - Called apply_rules
2022-06-17 14:49:26 - DrugExposure - INFO - Mapped drug_concept_id
2022-06-17 14:49:26 - DrugExposure - INFO - Mapped drug_exposure_end_datetime
2022-06-17 14:49:26 - DrugExposure - INFO - Mapped drug_exposure_start_datetime
2022-06-17 14:49:26 - DrugExposure - INFO - Mapped drug_source_concept_id
2022-06-17 14:49:26 - DrugExposure - INFO - Mapped drug_source_value
2022-06-17 14:49:26 - DrugExposure - INFO - Mapped person_id
2022-06-17 14:49:26 - DrugExposure - WARNING - Requiring non-null values in drug_concept_id removed 243 rows, leaving 117 rows.
2022-06-17 14:49:26 - DrugExposure - INFO - Automatically formatting data columns.
2022-06-17 14:49:26 - DrugExposure - INFO - created df (0x10accca00)[COVID_19_vaccine_3036]
2022-06-17 14:49:26 - CommonDataModel - INFO - finished COVID-19 vaccine 3036 (0x10accca00) ... 3/5 completed, 117 rows
2022-06-17 14:49:26 - CommonDataModel - INFO - starting on SARS-CoV-2 (COVID-19) vaccine, mRNA-1273 0.2 MG/ML Injectable Suspension 3040
2022-06-17 14:49:26 - CommonDataModel - WARNING - {'observation': 10000000000}
2022-06-17 14:49:26 - CommonDataModel - WARNING - indexing configuration has be parsed but this table (drug_exposure) has not be passed, so starting from 1
2022-06-17 14:49:26 - DrugExposure - INFO - Called apply_rules
2022-06-17 14:49:26 - DrugExposure - INFO - Mapped drug_concept_id
2022-06-17 14:49:26 - DrugExposure - INFO - Mapped drug_exposure_end_datetime
2022-06-17 14:49:26 - DrugExposure - INFO - Mapped drug_exposure_start_datetime
2022-06-17 14:49:26 - DrugExposure - INFO - Mapped drug_source_concept_id
2022-06-17 14:49:26 - DrugExposure - INFO - Mapped drug_source_value
2022-06-17 14:49:26 - DrugExposure - INFO - Mapped person_id
2022-06-17 14:49:26 - DrugExposure - WARNING - Requiring non-null values in drug_concept_id removed 231 rows, leaving 129 rows.
2022-06-17 14:49:26 - DrugExposure - INFO - Automatically formatting data columns.
2022-06-17 14:49:26 - DrugExposure - INFO - created df (0x10acd67c0)[SARS_CoV_2_COVID_19_vaccine_mRNA_1273_0_2_MG_ML_Injectable_Suspension_3040]
2022-06-17 14:49:26 - CommonDataModel - INFO - finished SARS-CoV-2 (COVID-19) vaccine, mRNA-1273 0.2 MG/ML Injectable Suspension 3040 (0x10acd67c0) ... 4/5 completed, 129 rows
2022-06-17 14:49:26 - CommonDataModel - INFO - starting on SARS-CoV-2 (COVID-19) vaccine, mRNA-BNT162b2 0.1 MG/ML Injectable Suspension 3041
2022-06-17 14:49:26 - CommonDataModel - WARNING - {'observation': 10000000000}
2022-06-17 14:49:26 - CommonDataModel - WARNING - indexing configuration has be parsed but this table (drug_exposure) has not be passed, so starting from 1
2022-06-17 14:49:26 - DrugExposure - INFO - Called apply_rules
2022-06-17 14:49:26 - DrugExposure - INFO - Mapped drug_concept_id
2022-06-17 14:49:26 - DrugExposure - INFO - Mapped drug_exposure_end_datetime
2022-06-17 14:49:26 - DrugExposure - INFO - Mapped drug_exposure_start_datetime
2022-06-17 14:49:26 - DrugExposure - INFO - Mapped drug_source_concept_id
2022-06-17 14:49:26 - DrugExposure - INFO - Mapped drug_source_value
2022-06-17 14:49:26 - DrugExposure - INFO - Mapped person_id
2022-06-17 14:49:26 - DrugExposure - WARNING - Requiring non-null values in drug_concept_id removed 243 rows, leaving 117 rows.
2022-06-17 14:49:26 - DrugExposure - INFO - Automatically formatting data columns.
2022-06-17 14:49:26 - DrugExposure - INFO - created df (0x10ad37bb0)[SARS_CoV_2_COVID_19_vaccine_mRNA_BNT162b2_0_1_MG_ML_Injectable_Suspension_3041]
2022-06-17 14:49:26 - CommonDataModel - INFO - finished SARS-CoV-2 (COVID-19) vaccine, mRNA-BNT162b2 0.1 MG/ML Injectable Suspension 3041 (0x10ad37bb0) ... 5/5 completed, 117 rows
2022-06-17 14:49:27 - CommonDataModel - INFO - saving dataframe (0x10a8a6130) to <carrot.io.plugins.local.LocalDataCollection object at 0x10a847a00>
2022-06-17 14:49:27 - LocalDataCollection - INFO - saving drug_exposure to ./test_outputs/local//drug_exposure.csv
2022-06-17 14:49:27 - LocalDataCollection - INFO - finished save to file
2022-06-17 14:49:27 - CommonDataModel - INFO - finalised drug_exposure on iteration 0 producing 604 rows from 5 tables
2022-06-17 14:49:27 - LocalDataCollection - INFO - Getting next chunk of data
2022-06-17 14:49:27 - LocalDataCollection - INFO - All input files for this object have now been used.

cdm['observation'].dropna(axis=1)
person_id observation_concept_id observation_date observation_datetime observation_source_value observation_source_concept_id
observation_id
10000000585 458 4288455 2020-11-20 2020-11-20 00:00:00.000000 61.597638175407624 4288455
10000000586 330 4288455 2022-09-12 2022-09-12 00:00:00.000000 6.413970727863393 4288455
10000000587 136 4288455 2019-02-02 2019-02-02 00:00:00.000000 4.691778998076871 4288455
10000000588 17 4288455 2020-08-18 2020-08-18 00:00:00.000000 14.77684965289811 4288455
10000000589 55 4288455 2019-11-23 2019-11-23 00:00:00.000000 79.61828150487227 4288455
... ... ... ... ... ... ...
10000001165 279 40757663 2019-11-10 2019-11-10 00:00:00.000000 Cancer 40757663
10000001166 494 40757663 2020-08-28 2020-08-28 00:00:00.000000 Cancer 40757663
10000001167 282 40757663 2020-02-03 2020-02-03 00:00:00.000000 Cancer 40757663
10000001168 284 40757663 2020-11-21 2020-11-21 00:00:00.000000 Cancer 40757663
10000001169 284 40757663 2019-09-06 2019-09-06 00:00:00.000000 Cancer 40757663

585 rows × 6 columns