I have a repository of text files produced by different authors. In each of these documents, there are a set of discrete data-points that I wish to extract. While the different authors use different templates and formatting to produce their respective documents, each document is attempting to provide values for the same master set of data-points that I wish to extract.
My team has developed a framework that utilizes regular expressions to automatically process our different document templates. The goal of this contract is to engage developers to review sample instances of the assigned document template, and then produce the regex statements to accurately and consistently extract the necessary values.
* Applicants will be provided with samples of the document template being assigned
* Applicants will be asked to submit at least 3 extraction rules to demonstrate an understanding of the parser framework, and also to demonstrate skill in producing robust and accurate matching rules.
Our team is committed to providing timely feedback and support in order to ensure a successful contract completion.
About the recuiterMember since May 20, 2018 Sunil Shirsat
from Sind, Pakistan