Remote Data Mining And Management Job In Data Science And Analytics

Web scraping, data extraction/cleaning with Python

Find more Data Mining And Management remote jobs posted recently Worldwide

We are a startup looking to obtain relatively clean data from more or less clean sources online.

Your job would be to scrape different websites, that include more structured data (tables) as well as more unstructured data (text). Some of the data can be obtained with simple URL requests (wget, requests, urllib), while other websites you will need to do searches including selecting filters and clicking buttons that require javascript (for example using selenium).

We would like the code to collect the data several times a day using a cron job, ideally set up on AWS EC2. Your code should be written in Python.

We would start with a one-off project for a few of the sites we are interested in and if we are happy with the person, potentially extend to an ongoing contractor arrangement.

Skills needed:
python, web scraping, requests, urllib, selenium, mongoDB, SQL, data extraction, data acquisition, data cleaning, databases, automation, scripting, cron jobs

We are flexible with payments. We can work hourly or on a fixed price basis, depending on the experience and time and cost estimate of the freelancer. We are hoping to spend less than 1000 dollars for the first 2-3 websites, with the scraper, cron job and database setup included.
About the recuiter
Member since Mar 14, 2020
Mr.uma Shankar
from Amazonas, Brazil

Open for hiringApply before - Sep 11, 2024

Work from Anywhere

40 hrs / week

Fixed Type

Remote Job

$958.43

Cost

Offer to work on this project closes in 69 days!
Are you interested in this Opportunity?

Looking for help? Checkout our video tutorial
How to search and apply for jobs

How to apply? Do you have more questions about the Job?
See frequently asked questions

Similar Projects

Data Extraction from Public Database into Daily Report

We are looking for a freelancer to build us an excel spreadsheet that can pull data from Public Database into Daily Report.

We would like the excel spreadsheet to lists the title, opportunity type (solicitation, sources sought, etc.), docum...read more

AWS Expert needed for long term project

We are developing a microservice-based Business Platform running solely on AWS.

To expand our capacities we are looking for an experienced developer to join our team with the main task to support our AWS Infrastructure configuration with fo...read more

Google Platform (Sites, Docs, Scripting, Calendar)

The project involves scripting, intranet, google calendar, google sheets, google sites as a base for a mentoring platform, I will describe remaining details with shortlisted candidates.. Please show some interest and share your work history with prop...read more

Need Zip Code Data for USA (Zip Code of Universities, Malls, Golf Courses, etc)

I want to be able to have zip codes of anything that can be considered a demographic.

For example which Zip Codes are primarly Colleges/Universities
Which Zip Codes have Golf Courses in them.
Which Zip Codes have Malls around themread more

CEO NAMES STREET ADDRESS CITY STATE ZIP - PROSPECTS

Need to capture:
CEO name
Address, city, state, zip
Mr/Ms.

There are 77 addresses needed and 90 CEO names.
need to have Mr/Ms filled for all