University Data Science 2023 Semester project Analytics & NLP

Do Elon Musk's Tweets Move Tesla's Stock?

Built a decade-long data pipeline with LLM sentiment analysis to test a popular investing myth.

For the capstone project in my Data Science course, I worked with a team of three to analyse whether Elon Musk's tweets correlate with Tesla's daily closing price. The hype around social media influence made for a compelling problem, but we wanted to validate it with rigorous methodology and transparent communication of limitations.

Key result: Found no statistically significant correlation, showing that day-to-day market forces outweigh long-form social media noise.

Building the Data Pipeline

Sentiment Analysis with a Human-in-the-Loop

We used a Llama 3 language model to score tweet sentiment. To counter sarcasm and emoji ambiguity, we integrated a human-in-the-loop review process. Samples were manually checked, and the team documented model failure patterns so future iterations could improve accuracy.

Regression & Interpretation

Linear regression models tested correlations between sentiment scores and stock price changes. Results were statistically insignificant, indicating other macro factors dominate price movements at the daily level. We explored shorter windows, but the noise floor remained too high without intraday data.

Skills Demonstrated

Back to article summaries