Remote Data Mining And Management Job In Data Science And Analytics

Python web Crawler (bot) that build a bilingual corpus

Find more Data Mining And Management remote jobs posted recently Worldwide

We need to build a web crawler (bot) that will traverse the high level domain like .co.uk or com.
The search for bilingual web sites.
Determine the languages of the site.
Scrap and align the text from the site.

There are many python libraries and research papers that talk about that. I think bitextor for example (which extract and align 2 html pages) will take care of the alignment.

We will be waiting for a detailed proposal how the project will be performed and the time frame.
About the recuiter
Member since Mar 14, 2020
Ria Rustanti
from Safaqis, Tunisia

Skills & Expertise Required

Data Scraping Web Crawling Python Data Extraction 

Open for hiringApply before - Oct 12, 2024

Work from Anywhere

40 hrs / week

Fixed Type

Remote Job

$476.30

Cost

Offer to work on this project closes in 60 days!
Are you interested in this Opportunity?

Looking for help? Checkout our video tutorial
How to search and apply for jobs

How to apply? Do you have more questions about the Job?
See frequently asked questions

Similar Projects

Experienced Data Analyst wanted

We are looking for experienced Data Analyst to analyze, extract, transform and visualize data sets to join our team for future client projects.

I am looking for someone who is very efficient at scraping/mining data from a websites

This project involves gathering a large amount of information in a very organized fashion.

Setup dash/plotly in gcp environment

The key deliverable will be to:
-Setup dash/plotly in our GCP environment.
-Setup adequate GIT repository to maintain underlying dashboard scripts
-Create a dashboard pulling data from both google sheets and google bigQuery.
-Creating...read more

Multi Agent Learning and Swarm Intelligence Tutor

I am looking for an algorithms tutor to teach me some content to prepare for my upcoming exam.

Must be familiar with things such as:

1) Multi Agent Learning
2) Swarm Intelligence
3) Artificial Immune System
4) DNA comput...read more

Web scraping expert needed

I have a python script to scrape data from a website. But I dont know why its not working. The script used Requests and BeautifulSoup libraries of python. Please dont waste my or your time if you are not experienced in web scraping or in other men...read more