Remote Data Mining And Management Job In Data Science And Analytics

Web scraping, data extraction/cleaning with Python

Find more Data Mining And Management remote jobs posted recently Worldwide

We are a startup looking to obtain relatively clean data from more or less clean sources online.

Your job would be to scrape different websites, that include more structured data (tables) as well as more unstructured data (text). Some of the data can be obtained with simple URL requests (wget, requests, urllib), while other websites you will need to do searches including selecting filters and clicking buttons that require javascript (for example using selenium).

We would like the code to collect the data several times a day using a cron job, ideally set up on AWS EC2. Your code should be written in Python.

We would start with a one-off project for a few of the sites we are interested in and if we are happy with the person, potentially extend to an ongoing contractor arrangement.

Skills needed:
python, web scraping, requests, urllib, selenium, mongoDB, SQL, data extraction, data acquisition, data cleaning, databases, automation, scripting, cron jobs

We are flexible with payments. We can work hourly or on a fixed price basis, depending on the experience and time and cost estimate of the freelancer. We are hoping to spend less than 1000 dollars for the first 2-3 websites, with the scraper, cron job and database setup included.
About the recuiter
Member since Mar 14, 2020
Ashok Kumar
from New York, United States

Open for hiringApply before - Oct 19, 2024

Work from Anywhere

40 hrs / week

Fixed Type

Remote Job

$955.68

Cost

Offer to work on this project closes in 90 days!
Are you interested in this Opportunity?

Looking for help? Checkout our video tutorial
How to search and apply for jobs

How to apply? Do you have more questions about the Job?
See frequently asked questions

Similar Projects

Collecting information and completing Excel sheet

Looking for somebody to gather data from the Internet and fill-in an Excel spreadsheet according to the instructions explained in the Codebook_file.
Good work is patiently collected, as complete as possible and coming from reliable sources.
...read more

Find & list stock prices for Google on Thursday & Friday at NOON in a spreadsheet

Its easy to find stock prices at the open or close. Its harder to find historical stock prices for specific times going back more than a few days. I need three years of data for stock prices at exactly Noon Eastern. Specifically Google (GOOG) and S...read more

Google Sheet and Web Scraping Expert Needed

Hello, I need someone who is an expert with Google Sheets and Web Scraping to help me create a Google Sheet that will scrape the property information (house address, asking price, property description) and contact infomation (name, phone number and e...read more

Backend Developer for Custom LinkedIn Data Parser and Extraction

Create Excel file/CSVs of reports based on company-specific filters that leverage extracted profile data. Certain filters currently dont exist in LinkedIn (e.g. Lawyers from a Top 50 law school who are currently unemployed), so we will provide rele...read more

AWS Website Hosting Architecture & Management

I am looking to move my dedicated servers from HostGator to AWS and would like to have someone design, setup and manage. I am interested in finding a setup that is equal or better than what we currently have. I would like to know estimated server c...read more