I need to extract data about companies from 4 websites. Websites are catalogs of franchise companies. Total number of records(companies) on all sites -2-4k
The data has good robust markup on websites so It can be done by using special software tools (plus some manual editing in unusual cases)
I will share a list of site with shortlisted candidates
Extracted data structure same for all websites there are just different languages
Output data need to be stored in Google Spreadsheet
The output data structure for each company look like this(example):
Name: Batteries Plus Bulbs
Category: Food
Is in rank?: 114
Founded year: 1988
Franchising Since: 1992
Initial Investment(low, USD): 190144
Initial Investment(high, USD): 367358
UNITS (2018, US): 666
UNITS (2018, Outside): 0
UNITS (2018, Company): 66
About the recuiterMember since Mar 14, 2020 Narendra Ganesh
from Quebec, Canada