Remote Data Mining And Management Job In Data Science And Analytics

Regular Expressions (RegEx) for extracting data from unstructured text documents

Find more Data Mining And Management remote jobs posted recently Worldwide

I have a repository of text files produced by different authors. In each of these documents, there are a set of discrete data-points that I wish to extract. While the different authors use different templates and formatting to produce their respective documents, each document is attempting to provide values for the same master set of data-points that I wish to extract.

My team has developed a framework that utilizes regular expressions to automatically process our different document templates. The goal of this contract is to engage developers to review sample instances of the assigned document template, and then produce the regex statements to accurately and consistently extract the necessary values.
* Applicants will be provided with samples of the document template being assigned
* Applicants will be asked to submit at least 3 extraction rules to demonstrate an understanding of the parser framework, and also to demonstrate skill in producing robust and accurate matching rules.

Our team is committed to providing timely feedback and support in order to ensure a successful contract completion.
About the recuiter
Member since May 20, 2018
Andy Hidayat
from Virginia, United States

Skills & Expertise Required

Data Science & Analytics Data Mining & Management 

Open for hiringApply before - Jul 31, 2024

Work from Anywhere

40 hrs / week

Hourly Type

Remote Job

$34.46

Cost

Offer to work on this project closes in 13 days!
Are you interested in this Opportunity?

Looking for help? Checkout our video tutorial
How to search and apply for jobs

How to apply? Do you have more questions about the Job?
See frequently asked questions

Similar Projects

CP Scheduling of Teacher Schedule

Currently we manually create schedules for our contract teachers and I was hoping to automate the process.

Our constraints are fairly standard:
* minimum and maximum number of teachers per shift
* minimum teachers with specific ski...read more

Data support needed to interpret Google Forms responses and translate to graphs

I need someone to take 216 Google Forms survey responses, aggregate them into data sets and translate those to graphs or charts. Our deadline is EOD Monday. My organization works in education and these survey responses capture feedback from our stude...read more

Clustering sales data into marketable segments

Clustering sales data (200k rows of sales positions, 50k customers) into marketable segments for use as foundation for direct marketing activities (what to offer to which segment to increase conversion rate in direct marketing)