Remote Data Mining And Management Job In Data Science And Analytics

Conversion of PDF bank statements into CSV

Find more Data Mining And Management remote jobs posted recently Worldwide

Were looking for a data extraction expert to extract transaction details from PDF bank statements into a CSV. There are 44 PDF statements. The fields to be extracted into a CSV:

Date -> Date of the transaction
Description -> Transaction description
Original Description -> Transaction description (same as Description)
Amount -> Transaction amount positive number
Transaction Type -> credit if Amount is positive, debit if Amount is negative
Category -> Blank
Account -> Blank
Name -> Blank
Labels -> Blank
Notes -> Blank

Deliverables:
- All transactions from the statements extracted into a consolidated CSV file
- Proof that the CSV reconciles with the statements (e.g. a separate Excel file with the transaction count, total debits and total credits from each statement matching the totals from the CSV)

Requirements for the project:
- Experience with Microsoft Excel
- Excellent grammar and a high attention to detail
- Experience with a scripting language to parse the statements (Python, Bash, Perl, PowerShell, etc)
- Experience with PDF parsing tool like Tabula useful

In your proposal, please share a brief summary of your experience. Attached are two statements from the two accounts.
About the recuiter
Member since Nov 11, 2022
Jhanahan Kumarasamy
from Piemonte, Italy

Open for hiringApply before - Oct 27, 2024

Work from Anywhere

40 hrs / week

Fixed Type

Remote Job

$47.81

Cost

Offer to work on this project closes in 30 days!
Are you interested in this Opportunity?

Looking for help? Checkout our video tutorial
How to search and apply for jobs

How to apply? Do you have more questions about the Job?
See frequently asked questions

Similar Projects

Need to parse a JSON file into CSV

The deliverable will be a CSV file that contains all of the fields stored in the JSON as well as a Python of NodeJS script that executes the conversion.

- No column in the CSV should have JSON data. That means the JSON must be fully unrolled...read more

Biomedical Named Entity Recognition Framework

Required a simple biomedical named entity recognition project that tags gene/protein terms. The system should take a sentence as an input. Besides, character and pre-trained word embedding should be used for feature learning. At last, Bidirectional L...read more

Automation for Facebook Groups & Messanger

There are many competitors Facebook groups for travel in every tourists city(4 groups for London 3 for New York etc..)
I want to monitor my competitors groups and send to each new member(monitor the new members list) a message via messenger to jo...read more

Webscrape Korean Trade Data (Data processing and mining)

This project requires an outstanding understanding of web-scraping techniques and tools. A candidate with some knowledge of Korean is preferred (not required).

This should be a faily quick project. I am happy to discuss more details and com...read more

Reverse IP look of an IP address and .csv file of all the domains hosted 250k

I am looking for someone who can do a reverse IP lookup of an IP address that has around 250,000+ domains.

I then want a .csv of all those main and sub domain addresses, the list must be up to date.