Remote Data Mining And Management Job In Data Science And Analytics

Scrape data

Find more Data Mining And Management remote jobs posted recently Worldwide

I need to extract data about companies from 4 websites. Websites are catalogs of franchise companies. Total number of records(companies) on all sites -2-4k

The data has good robust markup on websites so It can be done by using special software tools (plus some manual editing in unusual cases)

I will share a list of site with shortlisted candidates

Extracted data structure same for all websites there are just different languages

Output data need to be stored in Google Spreadsheet

The output data structure for each company look like this(example):
Name: Batteries Plus Bulbs
Category: Food
Is in rank?: 114
Founded year: 1988
Franchising Since: 1992
Initial Investment(low, USD): 190144
Initial Investment(high, USD): 367358
UNITS (2018, US): 666
UNITS (2018, Outside): 0
UNITS (2018, Company): 66

About the recuiter
Member since Mar 14, 2020
Amod Tiwari
from Ternopil's'ka Oblast', Ukraine

Skills & Expertise Required

Data Science & Analytics Data Mining & Management 

Open for hiringApply before - Sep 27, 2024

Work from Anywhere

40 hrs / week

Fixed Type

Remote Job

$14.28

Cost

Offer to work on this project closes in 21 days!
Are you interested in this Opportunity?

Looking for help? Checkout our video tutorial
How to search and apply for jobs

How to apply? Do you have more questions about the Job?
See frequently asked questions

Similar Projects

Processes and systems Expert

Work with certain key people to find out about a process of certain work, then Map it out using LucidChart. The skills required is asking the right question.

Deliverables
Create lots of flowchart using lucidchart

Data scraping list of customer details

We need several lists scraped and put onto an excel spreadsheet by Friday!

We need someone quick for the job.

Data Extraction & Visualization pull data from database (without API) and formatted KPI dashboard

Create a tool that pulls data from a password protected website into a database without the use of an API, on a recurring basis. At least daily if not hourly / real-time. Website is MindBodyOnline

The resulting database is to be housed in th...read more