Remote Data Mining And Management Job In Data Science And Analytics

Scraping CMS data

Find more Data Mining And Management remote jobs posted recently Worldwide

Need to scrape CMS data, and deliver scraping script (in Python), as well as as a database (SQLite) with the data

Looking for a person to scrape (using python, ideally requests library) to do the following steps:

1) Scrape CPSC data: a file downloaded per month for all available years (full version, not abridged)

2) Scrape plan cross walk: a file downloaded for every year available

(removed by Toogit admin)

3) Scrape STARS measures: I want the fall Report_Card_Master_Table excel for each year of data (in the zip), then only extract the Measure_data tab from that excel


3) Create SQLite (or open source) database
SQLite database should have the following:
a) One table that contains all the rows from CPSC_enrollment_info files (but add 2 columns, 1 for year and 1 for month depending on entry from file)
b) One table that contains all the rows from CPSC_contract_info files (but add 2 columns, 1 for year and 1 for month depending on entry from file)
c) One table that contains all the rows from plan crosswalk (add 1 column depending on year of the file)
d) One table with the measure_data from the STARS file (add 1 column depending on the year file)

Deliverable:

1) Python script
2) SQLite (or other open source alternative) database with all the tables mentioned above
About the recuiter
Member since Mar 14, 2020
Rudy Hartono
from Trat, Thailand

Skills & Expertise Required

Data Scraping MySQL Programming Python 

Open for hiringApply before - Oct 22, 2024

Work from Anywhere

40 hrs / week

Fixed Type

Remote Job

$95.59

Cost

Offer to work on this project closes in 90 days!
Are you interested in this Opportunity?

Looking for help? Checkout our video tutorial
How to search and apply for jobs

How to apply? Do you have more questions about the Job?
See frequently asked questions

Similar Projects

AI (RASA) Developer

Looking for a strong developer with hands-on experience in Natural Language Frameworks (e.g., Rasa, Spacy, NLTK), Machine Learning, Automatic Speech Recognition, Text-to-Speech.

MUST HAVE - Experience with RASA and RASA-X

Data needed to build a database of all restaurants in the US

I need to build a database of all restaurants in the United States.

The entry for each restaurant needs to include:
Restaurant name
Address
Cuisine
Hours of operation
Phone number
Website (if any)

You can col...read more

Linear Regression tutorial paper with application to sustainability

We need a tutorial paper for teaching undergraduate CS major level students about using linear regression for data analysis (exploratory data analysis preferred). We need a formal tutorial paper, that explains the theory behind a specific type of dat...read more

Integrate eCommerce sales to Point of sale software to auto invoice

the orders which are sold on the eCommerce site should come to the point of sale software and create invoices automatically. Need to capture the details from order email which gets sent out from big commerce hosted site.

Machine Learning - Comparison of Brands with Description

Hello Applicants,

I am looking at building an algorithm that compares Description of the item with the respective brand for a comparative study

I will share some files with shortlisted candidates there are TWO Worksheets. First shee...read more