Remote Data Mining And Management Job In Data Science And Analytics

Data Cleaning with Trifacta and R

Find more Data Mining And Management remote jobs posted recently Worldwide

I have a continuing flow of data that is extracted from school websites that needs to be checked and validated before it is made available for an R analytics platform.

I am looking at using a combination of R (we already have quite a lot of code) and Trifacta. The data sets are small but they need to be joined together very accurately. Often the data contains errors and incomplete data for linking across sources. We either access the required data from previous data that has been ingested or ask schools for the additional data.

The first task in the process is to identify all issues of validity and completeness in each data set, followed by implementing a strategy for to fix any issues.

I am seeking a consultant who is familiar with Trifacta and/or R to build a strategy that targets each data source with a series of analyses that locate the issues in the data that is drawn from that source. In total there could be up to 100 sources for which we need to develop recipes in this cleaning and validation stage.

We want to automate the process as much as possible, by adding additional rules/procedures to each recipe until it contains all the steps required for the data that comes from each specific source.
About the recuiter
Member since Mar 14, 2020
Karan Marwah
from Neamt, Romania

Skills & Expertise Required

R 

Open for hiringApply before - Aug 28, 2024

Work from Anywhere

40 hrs / week

Hourly Type

Remote Job

$26.85

Cost

Offer to work on this project closes in 59 days!
Are you interested in this Opportunity?

Looking for help? Checkout our video tutorial
How to search and apply for jobs

How to apply? Do you have more questions about the Job?
See frequently asked questions

Similar Projects

Data Science Tutor

Im looking for a data science tutor to make live calls every day to complete a project together and pay by hour.

The help may require couple of hours or maybe more
I will also look for help in the future in R statistical exercises.

Analysis of Brexit Negotiations

I am seeking an analysis of the approach taken to the Brexit negotiations from a negotiation theory perspective. The analysis should utilise secondary research on negotiation theories (integrative) and approaches, including game theory concepts as we...read more

GGplot2

Hi
I need to make a ggplot2 data visualization (a bar graph) for my studies. Actually I need ANY bar graph made with ggplot2 in R, that cant be found in the Internet. I will need it with a short presentation of which function does what, so lat...read more

Need Machine Leaning Expert (Good to have Deep Learning Knowledge)

1. Need a machine learning expert with 5-6 years of experience
2. Good knowledge of python,R & Visualization. Should have knowledge of common python libraries & R packages but should have knowledge of OOPs. Should have good data cleaning experie...read more

Correlation Analysis

I am looking for a capable individual that can put survey data into correlation with public company stock performance. The survey data has predictive implications, hence I am trying to link the forecast to the actual stock performance or another meas...read more