- If you are a windows user...
a. Windows batch scripting in cmd prompt
http://steve-jansen.github.io/guides/windows-batch-scripting/part-1-getting-started.html
b. Using bash commends like awk in cmd prompt is possible too!
https://www.logicsupply.com/explore/io-hub/how-to-enable-linux-bash-in-windows-10/
- Algorithms
a. Discussion on tree models:
https://www.analyticsvidhya.com/blog/2016/04/complete-tutorial-tree-based-modeling-scratch-in-python/
- Tricks in Pandas DataFrame operations
a. Opeerations to combine with pd.groupby(), such as .unstack()
https://chrisalbon.com/python/pandas_apply_operations_to_groups.html
b. use .agg() to find the most frequent value (mode) after df.groupby()
https://stackoverflow.com/questions/15222754/group-by-pandas-dataframe-and-select-most-common-string-factor
c. binning, or preparing for histogram
https://chrisalbon.com/python/pandas_binning_data.html