Remote Data Mining And Management Job In Data Science And Analytics

DATA INTEGRATION AND AI MODELING

Find more Data Mining And Management remote jobs posted recently Worldwide

1. Project Background and Description:

Access, real time satellite data, surface measurement and model data to map PM2.5 over America.

2. Data Sets:
a. GOES-R data (every hour) from Google Clouds (netcdf format)
b. Real time surface PM2.5 measurements (OPENAQ.ORG, API access)
c. NASA GEOS-5 data (every hour, netcdf format)

3. Geographical Scope: Americas (North & South)

4. Tasks:

4.0 Develop a python code to download PM2.5 data from openaq.org every hour. This data will be saved into a database (postgres). We will provide an existing code that can be modified to perform these task.

4.1 Develop a python code to download, read and extract GOES-R data. This code will run every hour to download new data, it will read data over ground locations and save certain parameters in the database. There will be multiple files each hour to read and extract the data from.

4.2 Develop/Modify a python code to download, read GEOS-5 data. This code will read several parameters for the same locations as in 4.1. The data are in netcdf format. We will provide an existing code that can be modified to perform these task. Again, extracted data will be saved in the database along with data extracted from GOES-R.

4.3 The combined data set should be collocated in space and time i.e. for every date/time stamp there will be several parameters reported in the database from three difference sources. The initial data will be processed on Microsoft Azure. The database will created for the first six month of year 2018. The database will be divided into training, testing and validation.

4.4 Develop machine learning/AI algorithm to estimate PM2.5 (openaq, 4.0) based on inputs from 4.1 & 4.2. This is one of the most important task and will require some research. This task will use database collected in task 4.3. The AI algorithm will be created using AI tools available through Microsoft Azure at Azure cloud system. The AI algorithm must be trained well, optimize to perform well with independent data sets. 10-fold validation using validation data sets must be performed and presented in graphical forms with table and numbers with error estimations. The AI model will be developed separately for Northern America and Southern America. Each will be evaluated separately. The AI development work is important and must be satisfactory to be accepted for the submission.

4.5 Implement the developed AI model with inputs from two data sets (GOES-R, GEOS-5) and create maps of estimated PM2.5 for the larger region. This code should run in real-time every hour using cronjob by using codes from 4.0, 4.1, 4.2. Also, extract the data to be entered into database.

4.6 Develop a document with details on code, AI models, error estimates and other details. This document should be detailed enough for someone else to duplicate the work created here.

5. Deliverables:

1. Source codes - all including AI models.
2. All codes must work in production environment in microsoft cloud
3. AI models in form of python codes and original formats.
4. Graphical results and explanation of AI model performance during training, testing and
validation
5. Integrated data sets in csv format (openaq, goes-r, geos-5) for six months.
6. Demonstration of successful running of all codes and AI models on Microsoft Azure cloud.
7. Report documenting all the details of the codes and AI models

Skills Required:

Pythons, Django, AI/Machine Learning on Microsoft Azure, Azure cloud, data analysis and good understanding of statistics.

The project will be divided into 3 mile stones ($100, $300, $100) and will be decided after discussion with freelancer and may not be in equal amount.
About the recuiter
Member since Mar 14, 2020
Krishna Kiran
from Vojvodina, Yugoslavia

Skills & Expertise Required

SQL Azure Microsoft Windows Azure 

Open for hiringApply before - Jan 23, 2025

Work from Anywhere

40 hrs / week

Fixed Type

Remote Job

$477.90

Cost

Offer to work on this project closes in 185 days!
Are you interested in this Opportunity?

Looking for help? Checkout our video tutorial
How to search and apply for jobs

How to apply? Do you have more questions about the Job?
See frequently asked questions

Similar Projects

Azure, AWS, and GCP Tutorials Wanted

Were looking for someone to create a few cloud tutorial videos for Azure, AWS, and Google Cloud, that we can use on our YouTube channel.

The tutorials should be engaging and visually appealing, and must include as many practical demonstrati...read more

Certified azure and windows server admin

Looking for a certified azure admin with in depth knowledge of azure VMs , app services, SQL server for part time work in EST hours. Prefer real time communication using skype or slack . This is not a full time position but we need a reliable resour...read more

Facebook Instant article help or approval service

I need my blogspots blog to connect to instant articles. I need help in that problem or I need approval service for that. Waiting for your proposals. I will pay you 40$ for approving.

Deploy on Azure .NET an app (.NET Core + Angular) as Server Side Rendering (SSR).

I have an app on .NET Core 2.1.1 and Angular 7.
I implemented universal angular as explained

1. ng add @nguniversal/express-engine --clientProject angular.io-example
2. npm run build:ssr
3. npm run serve:ssr

It is worki...read more

Programmer needed to export email contacts from office 365

I need a good computer programmer to export email contacts from office 365 directory default global address list.