Remote Data Mining And Management Job In Data Science And Analytics

CSV File Manipulation

Find more Data Mining And Management remote jobs posted recently Worldwide

We are looking to find someone who can help us with some CSV data manipulation and also some data scraping.

We are rebuilding an ecommerce store and are looking for someone with great excel, CSV, data manipulation and data entry skills to fulfil a project for us.

We need someone to be able to manipulate one line of data to populate a new CSV file format.

The old file and the template will be provided on request.

We will also require you to go into the back end of the current live website and save and match all the image so that they match the CSV and can be uploaded into the new website.


The customer will partially expand the each product depending on what size and colour variations available. In Y1001H first row indicates that the product in size 12 is only available in B,CD cup sizes and only in Beige colour. Here commas are used to show the product belonging to multiple attributes. Therefore Y1001H product is represented from row 2 to row 8. I have colour coded the original file and the expanded file to understand better how the data expansion work.

Example: if you have 3 colours 2 bra sizes and 4 bra cup sizes then your total rows will be 3x2x4 = 24

not just bras there will be other products so if we have a product with 6 sides and 5 colours then number of rows this will be expanded to will be 6x5 = 30

The only columns that will change when expanding are H, I, J, K, R, S and the rest will stay the same so you can duplicate the data.

When expanding the data the first entry is considered the parent product so we use the style code (colum G value) as the SKU and then following child products are given an increment starting with -1. We do this because the SKU needs to be unique. The product code is unique but each product code has different colors and size options so we need to distinguish each variation.

Expanding the data is the first step then the next step will be saving images from an existing website. To do this you will be given access to a website to locate a product using the style code (columns G) and saving each colour variation image of that product to the SKU generated in the CSV. Example for the product Y1001H you will find a picture matching the colour beige then you need to save this image by merging the style code and color and place that value in the CSV (column W). The merging of columns G and column k should be done with an underscore separating the two values and at the end have the image extension if its is an JPG/JPEG/PNG. You may or may not get the same extension. I have only done one example for this in the expanded file attached. You may come across the same colour but photos in different angles for this simply add a number at the end of the name and separate them with commas. (Y1001H_Beige_1.jpg, Y1001H_Beige_2.jpg, Y1001H_Beige_3.jpg etc...)

Product type (column V). Not all products are variable. In the list, most of them are but very few are not variable. This means they only have one option. These products will not have any multiple colours or sizes. This is a rare occasion but possible and you can leave it as it is because it will take up only one row. Just make sure column V is set to simple so the importer knows its a simple product.
About the recuiter
Member since May 20, 2018
Jitender Kapila
from Marijampoles, Lithuania

Skills & Expertise Required

Data Science & Analytics Data Mining & Management 

Open for hiringApply before - Aug 1, 2024

Work from Anywhere

40 hrs / week

Hourly Type

Remote Job

$17.25

Cost

Offer to work on this project closes in 28 days!
Are you interested in this Opportunity?

Looking for help? Checkout our video tutorial
How to search and apply for jobs

How to apply? Do you have more questions about the Job?
See frequently asked questions

Similar Projects

TOWS map updates

Someone with the ability to take data and update mandated zoning and future land use maps. Talent that can transcribe the hard data into a GIS format to reflect all recent annexations, rezonings and future land use updates.

Financial Time Series Forecasting Project

Time Series Forecasting.
Forex Currency Pair EUR/GBP

I will provide you with approximately 20 years of 1 millisecond Data.
You take this data and perform Deep Learning methods on the data.
Perhaps using an RNN, LSTM.
Using pe...read more

Requirement for a Data Analyst for a e-commerce company

Looking for a candidate who can fulfill the following requirements :

Mandatory experience with E-commerce company and should be able to crawl huge amounts of data with Phyton and data mining background.

*Experience in data models...read more