Project About me Blog

Blog

Week 5-6:

3/8/26
I was sick for much of this period, however we still have developments as far as exploring how to fit models to the data we have. The solution that I came to was that it will be much easier to model the data in a slightly different form, containing both total points for a player and change in points from the previous period. The process of obtaining this data in the right form continues and will hopefully be solved shortly. This week also brought a change to looking primarily at weekly rankings in order to allow for other factors such as seasonality to be considered.

Week 4:

2/22/26
Other modeling solutions including K nearest neighbors have emerged as possible solutions, though data wrangling has remained a problem. A success from this week came in creating a solid timeline with broken up goals that should make staying on track easier. The next step will come in changing the data form to be modeled correctly.

Week 3:

2/15/26
Running smoothing techniques on this cross-sectional data has proven to be a challenge, as solutions such as summarizing average points by age have proven to be non-intuitive. I have found possible solutions in vector exponential smoothing to alleviate this issue. Right now, the focus is primarily on the data containing end of year rankings rather than weekly, so it will be nice to see what works well and what does not before moving on to more granular data.

Week 2:

2/8/26
After meeting with my advisors I was introduced to possible methods for continued EDA. I also found more data sources that may help in exploring factors such as age and court surface performance affecting player rankings. Techniques were discussed as possibilities to remove this seasonality element to allow for smoothing techniques. Had to do a little bit of wrangling to create the table for top 100 rankings for the entire timeframe, but should now have a much easier time exploring the data.

Weeks 0-1:

2/4/26
This week started with an unfocused idea for a project and ended with a much more clear direction of where we are headed. A big accomplishment from this week was finding the datasets that will drive this project. Additionally, I have been doing some basic exploratory data analysis, including identifying possibilities for functional form and modeling techniques.