/ML_finance

Repository containing a project for machine learning in finance university course

Primary LanguageJupyter Notebook

ML_finance

Repository containing a project for machine learning in finance university course

Link to created dataset: https://we.tl/t-pgiAJ7nrhE

Suggested code format: jupyter notebooks

It's supposed to be a python script by the end, but it shouldn't take too much time converting notebooks to a script (hopefully)

Resources

Great guide for time series analysis: https://www.analyticsvidhya.com/blog/2021/10/a-comprehensive-guide-to-time-series-analysis/

About yahoo finance api for stock prices: https://www.analyticsvidhya.com/blog/2021/06/download-financial-dataset-using-yahoo-finance-in-python-a-complete-guide/

Done

9.04 Functional and non-functional requirements document by 14/03/2022 (changed to 11/04/2022)

Person responsible: Kasia Górczyńska

• Gathered and summarised

7.04 Data preparation (by 11/04/2022) (DONE 8.04, sorry for the delay)

Person responsible: Jacek Jankowiak

• Gather general statistics about the provided convictions time-series

• Identify any potential problems with data, filter outliers etc.

• Prepare a master index of stock symbols and retrieve corresponding price data for each symbol (sources, e.g. Yahoo Finance or Google)

9.04 Implement a simple analysis harness that (by 28/03/2022) (changed to 11/04/2022)

Person responsible: Mikołaj Szymczak

• a) splits data into training and test sets,

• b) trains a dummy model,

• c) performs a cross-validation and

• d) generate stats in a form of a pdf report.

11.04 Initial modelling (11/04/2022)

Person responsible: Kasia Górczyńska + Mikołaj Szymczak (+ Jacek Jankowiak)

• Implement a simple regression model to be used as a baseline for further analysis

• Train, test and cross-validate the regression model and show that the model generalizes on the out-of-sample data

Advanced modelling (by 30/05/2022)

Person responsible: Everyone

• Develop, train, test and cross-validate three alternative models

• Compare performance of the models against a baseline model and against each other for the various investment horizons

• Perform stability and sensitivity analysis w.r.t baseline model and investment horizons

TODO

All done!