Remote Web Development Job In IT And Programming

Web scraping and automation

Find more Web Development remote jobs posted recently Worldwide

Web Scraping + Automation + Excel
Looking for an experienced Web Scraping, Data Mining and Automation specialist. This is a very well defined and streamlined task that involves triggering multiple scripts form within excel, with each script performing a web scraping operation and returning the data/findings to excel.

The job involves web scraping 4 websites for different pieces of data and integration with excel.
Each step below needs to be run from within Excel, executed as an excel macro maybe?

Inputs:
Websites A, B, C, D
Dates from(D1) and to(D2)

Step (1)
- Go to website A
- Bypass captcha and move to next/search page
- Perform a search between D1 and D2, and selecting an additional choice from a multiple choice item.
- Results will be displayed as a table.
- For each row, if one of the fields equals X, press on a link to open the details
- Extract some information from the page, say into columns C1, C2, C3 ... C10
- We need a pdf from that same page, download pdf and upload to Google Drive (credentials will be provided).
- Populate excel sheet with C1 (with hyperlink to pdf), C2, C3, ... C10

Step (1) BONUS - If you are able to do this, Im willing to pay a higher price.
Extract two fields from pdf - by performing OCR - and store as C11 and C12

Step (2)
- Go to website B
- For each row obtained in Step (1), use two columns - say C3 and C4 - to search for information on website B.
- Extract info found on page into some columns C11 ... C15
- Make C11 a hyperlink to the result page, if possible.
- If not possible, get a pdf of the page obtained and upload to Google Drive, then make C11 a hyperlink to that file. (Printable version of the page is available on website B after performing the search).

Step (3)
- Go to website C
- For each row obtained in Step (1), use two columns - say C12 and C13 - to search for information on website C.
- Extract info found on page into some columns C16 ... C20
- Make C16 a hyperlink to a public website providing input of C16 value.

Step (4)
- For each row obtained in Step (1), use column C20 as input to perform an API call to a publicly available service (simple API call).
- Store result in column C21

Step (5)
- For each row, perform a simple mathematical operation between a couple of columns in that row and store as an additional column, C22

Step (6)
- Filter sheet based on C22 value.

Step (7)
- Go to website D, enter credentials (provided).
- For each row, perform a search on the website, get information and store in excel sheet C23 ... CN

Done

Total of 7 buttons/macros in excel sheet ... Each step performs a function. You will need to sign an NDA.
About the recuiter
Member since Nov 11, 2022
Jeganath Maruthai
from Dalaba, Guinea

Skills & Expertise Required

software development Website Development 

Open for hiringApply before - Oct 17, 2024

Work from Anywhere

40 hrs / week

Fixed Type

Remote Job

$95.52

Cost

Offer to work on this project closes in 86 days!
Are you interested in this Opportunity?

Looking for help? Checkout our video tutorial
How to search and apply for jobs

How to apply? Do you have more questions about the Job?
See frequently asked questions

Similar Projects

Help manually remove metasploit payload from iPhone

Are you a penetration tester or kali linux/metasploit user?

We will have a meeting on skype to help walk me through how to remove metasploit payload on my hacked device. The deliverable is to help me manually remove metasploit payload(s) fro...read more

Seagull Scientific Bartender- Powershell Scripting support

Dear Applicants,

We require scripting support to resolve errors on power shell script errors using Bartender
Want to arrange remote log in to run through errors and clear up errors or create a better solution

Powershelll script...read more

Linux System Administrator with good experience in Open SSL

I need a system admin to handle handshake error due to difference in cipher suite.
I will provide a complete details of The TLSv1.2 cipher suite on my server when job is awarded.

AWS Maintenance Services

AWS Maintenance services,
1. Monthly service report
2. Qtrly service report
3. Privileged Access in AWS
4. Provide SOC 2 reports
5. User roles
6. Pass role permissions
7. AWS Service Custom Permissions

Be the lead on a mariadb and Galera project

Looking for an experienced professional with extensive mariadb and Galera cluster background to help optimize and update a current platform.