Remote Data Mining And Management Job In Data Science And Analytics

Large-Scale Topic Model / Sparse Matrix Construction

Find more Data Mining And Management remote jobs posted recently Worldwide

I have a larger corpus (-38K documents) that I am attempting to run topic models over. The problem right now is that creating a corpus via packaged solutions, like Textacy, puts too much strain on the memory for the doc-term-matrix. I believe there are more efficient solutions, perhaps collapsing each document into a Counter object and incrementally feeding it into a sparse matrix, but I am looking for someone with more experience working with larger, data-intensive memory sets and scientific computing to create code that can easily be transported over. I can provide the full text files as a corpus, and am available to discuss the project on an ongoing basis.
About the recuiter
Member since Mar 14, 2020
Himanshu Shekha
from Nunavut, Canada

Open for hiringApply before - Aug 10, 2024

Work from Anywhere

40 hrs / week

Hourly Type

Remote Job

$26.83

Cost

Offer to work on this project closes in 37 days!
Are you interested in this Opportunity?

Looking for help? Checkout our video tutorial
How to search and apply for jobs

How to apply? Do you have more questions about the Job?
See frequently asked questions

Similar Projects

Solving a fractional differential equation using a neural network.

I want to solve a fractional differential equation. I already make neural network in TensorFlow but it has some bugs I couldnt fix it. May be you can do it.

Looking for a Full stack developer to help add new features to existing web application.

We are looking for a full stack developer to add new features to an existing web application. Must have skills include 10+ experience with HTML, Python, and Java Script.

Preference given to candidates who have expertise with mongoDB, Atlas,...read more

Implement pairs trading algorithm using Random Forests

Implement pairs trading algorithm using Random Forests.
No overfitting.

Acceptance test: Metrics for 2000-2017:
Sharpe Ratio: 2
Performance: 30%

Preferred platform: Quantconnect.

I need data scraped from various website and to document that in a particular mentioned format

I need data scraped from a 2 websites like amazon and Flipkart and store all the data in a particular format. All the product details to be scrapped and the format to save it in csv will be communicated to you.
Right now looking for a freelancer...read more

Natural Language Processing AI Project

I need an AI voice developed for scripted outbound calls. I will provide scripting, database, recordings for tonality, etc.