Remote Data Mining And Management Job In Data Science And Analytics

Regular Expressions (RegEx) for extracting data from unstructured text documents

Find more Data Mining And Management remote jobs posted recently Worldwide

I have a repository of text files produced by different authors. In each of these documents, there are a set of discrete data-points that I wish to extract. While the different authors use different templates and formatting to produce their respective documents, each document is attempting to provide values for the same master set of data-points that I wish to extract.

My team has developed a framework that utilizes regular expressions to automatically process our different document templates. The goal of this contract is to engage developers to review sample instances of the assigned document template, and then produce the regex statements to accurately and consistently extract the necessary values.
* Applicants will be provided with samples of the document template being assigned
* Applicants will be asked to submit at least 3 extraction rules to demonstrate an understanding of the parser framework, and also to demonstrate skill in producing robust and accurate matching rules.

Our team is committed to providing timely feedback and support in order to ensure a successful contract completion.
About the recuiter
Member since Nov 11, 2022
Kumari Smita
from Stredochesky, Czech Republic

Skills & Expertise Required

Data Science & Analytics Data Mining & Management 

Open for hiringApply before - Sep 11, 2024

Work from Anywhere

40 hrs / week

Hourly Type

Remote Job

$34.30

Cost

Offer to work on this project closes in 30 days!
Are you interested in this Opportunity?

Looking for help? Checkout our video tutorial
How to search and apply for jobs

How to apply? Do you have more questions about the Job?
See frequently asked questions

Similar Projects

Expert developer needed to convert exported mvnforum data to bbPress and configure bbPress forums

We have existing forums running on java-based mvnforums that is hosted by a vendor. <5000 posts and <200 users. We want to migrate to Wordpress-based solution bbPress.

We are not able to connect directly to the vendors servers, but they are...read more

GDPR / Data Consultant

Were a social media agency, based in London.

Were looking for a consultant to help us advise, and assist with data capturing.

Were looking to understand GDPR compliance around email capture, device ID capture, retargeting captur...read more

using whatever scripting language appropriate to check an url alexa ranking and site classification

using appropriate script language, keep it simple
url provided
check website alexa ranking (or equivalent) and site classification
insert records into mysql db