Remote Data Mining And Management Job In Data Science And Analytics

Regression analysis to improve soccer stat

Find more Data Mining And Management remote jobs posted recently Worldwide

I have a list of soccer games and 3 statistics. From these stats Im trying to come up with the most accurate number of expected goals scored for both teams.

The three stats are shots, TIL, and HW.

To come up with an average shots/goal rate for each team we simply take the total home shots and divide by the total home goals, and total away shots divided by total away goals. I already did this in the table in Sheet3.
Once we have these two numbers, knowing the amount of shots each team took in a game we can find out the expected goals for each team (shots taken*shot/goal rate).

Now, I want to make this expected goals number more accurate by incorporating two other stats, TIL and HW. I believe these stats can be useful because when filtering for teams with a high TIL and HW, we can see that their shot/goal rate goes down, meaning it takes them less shots to score a goal.

I tried to do this myself with basic linear regression but for some reason it made the expected goals less accurate. Maybe a different form of regression would work better, or something else altogether.

Looking at the spreadsheet I attached, you can see the results of my attempts in columns AL and AM of the Games sheet. Sheet3 is where I ran the regression. Lineups sheet can be avoided.

Most qualified candidate for the best price will be hired in the next 24 hours.
About the recuiter
Member since May 20, 2018
Manish Lal Moha
from Kyonggi-do, South Korea

Open for hiringApply before - Jul 27, 2024

Work from Anywhere

40 hrs / week

Hourly Type

Remote Job

$9.57

Cost

Offer to work on this project closes in 10 days!
Are you interested in this Opportunity?

Looking for help? Checkout our video tutorial
How to search and apply for jobs

How to apply? Do you have more questions about the Job?
See frequently asked questions

Similar Projects

Simple Excel/VBA Auto-fill auto-populate dynamic PDF project

Our existing excel workbook has the following Data:
Sheet 1 = Every Customer information (Name, Address, Phone, email etc)
Sheet 2 = Every Contractor information (Name, Address, Phone, email etc)
Sheet 3 = Every Project Information (type...read more

R Coder needed for financial analysis project. Python skills also a plus.

Goal is for someone proficient in R coding to produce a file based on excel document/template. Python proficiency would also be a plus.

Seeking a robust financial analysis with ability to plug in data sets via APIs for real-time analysis of...read more

Personal/administrative services, including mail merge, for social media marketing startup

We offer a range of social media services such as Facebook ads, videos and Google page ranking.

Were looking for a virtual assistant to complete projects that are anticipated to require approximately 2-5 hours per week. The work will includ...read more

CCTV Behaviour Analysis

All coding must belong to me after the project (all IP will be bought exclusively).

I require analytics of live CCTV footage to determine:
- how much of time an employee is actually at their station working
- who the employee is (ba...read more