Need to scrape CMS data, and deliver scraping script (in Python), as well as as a database (SQLite) with the data
Looking for a person to scrape (using python, ideally requests library) to do the following steps:
1) Scrape CPSC data: a file downloaded per month for all available years (full version, not abridged)
2) Scrape plan cross walk: a file downloaded for every year available
(removed by Toogit admin)
3) Scrape STARS measures: I want the fall Report_Card_Master_Table excel for each year of data (in the zip), then only extract the Measure_data tab from that excel
3) Create SQLite (or open source) database
SQLite database should have the following:
a) One table that contains all the rows from CPSC_enrollment_info files (but add 2 columns, 1 for year and 1 for month depending on entry from file)
b) One table that contains all the rows from CPSC_contract_info files (but add 2 columns, 1 for year and 1 for month depending on entry from file)
c) One table that contains all the rows from plan crosswalk (add 1 column depending on year of the file)
d) One table with the measure_data from the STARS file (add 1 column depending on the year file)
Deliverable:
1) Python script
2) SQLite (or other open source alternative) database with all the tables mentioned above
About the recuiterMember since Mar 14, 2020 Lionel Gough
from England, United Kingdom