hjeffreywang
Data scientist with a background in engineering, data analytics, database management, ChatGPT implementation, and LLM development.
hjeffreywang@gmail.com
Pinned Repositories
Advanced-Data-Science
Assignments of Machine Learning in Python
Advanced-Prediction-of-Multi-stage-continuous-flow-manufacturing-process
Created data regression models to predict 15 unknown variables within 4% error at any plant condition. Denoised 11000 x 115 dataset, leading to 70% improvement in prediction accuracy. Used data engineering to produce features that were relevant to the target variables. Produced confusion matrices to determine relevant features for machine learning. Final prediction horizon of four times dataset timescale to within 5% error
Alpaca-Algotrading--Legacy-
Engineered and backtested strategies using a variety of indicators and custom features. Optimized buy/sell conditions through custom optimization algorithm. Discovered most significant features through machine learning. Created Heroku app to maintain code during downtime. Implemented a portfolio manager that automatically handles, scales, and visualizes financial gains and losses. Current design yields an SQN of 8.9-9.2.
Analysis-and-time-series-forecasting-COVID-South-Korea-
Automatic_Timeline_setup
Bayesian_analysis_workflow
Beginning-Application-Development-with-TensorFlow-and-Keras
Learn to design, develop, train, and deploy TensorFlow and Keras models as real-world applications
Failed_LSTM_network
Last-Minute-Notes-of-Machine-learning-and-Deep-learning
Last Minute Note of Machine learning and Deep learning by Jason Brownlee
Stock_feature_engineering
Created a continuous, homogeneous, and structured 10 GB dataset from self obtained collections of unstructured intraday financial data. Generated features from indicators, statistics, and recent factors. Used multi-disciplined analysis to find feature importance. Attached labels of trends and stop/hold positions for machine learning. Used machine learning to significant features.
hjeffreywang's Repositories
hjeffreywang/Ad-Effectiveness-Report
hjeffreywang/Airflow-PaddleOCR-test
storing script test of paddleocr for airflow docker paddleocr architecture
hjeffreywang/anthropic-cookbook
A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.
hjeffreywang/anthropic-sdk-python
hjeffreywang/Awesome-Astra-docs
hjeffreywang/awesome-data-centric-ai
Open-Source Software, Tutorials, and Research on Data-Centric AI 🤖
hjeffreywang/Cancer_Detection_CNN
hjeffreywang/cleanlab
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
hjeffreywang/Data-centric-AI-proof-of-concept
hjeffreywang/data-engineering-zoomcamp
Free Data Engineering course!
hjeffreywang/Deeplearning_TweetClassification
hjeffreywang/dolly
hjeffreywang/E7_autoshop
hjeffreywang/Event_Aware_CNN_POC
hjeffreywang/extractnet
A fork of Dragnet that also extract author, headline, date, keywords from context, as well as built in metadata extraction all in one package
hjeffreywang/Generative_adversarial_Testing
hjeffreywang/Gradient-Boosted-Approach-to-Game-metrics
hjeffreywang/justsubs
Download subtitles from YouTube as plain text.
hjeffreywang/neuralforecast
Scalable and user friendly neural :brain: forecasting algorithms.
hjeffreywang/nist-crc-2023
NIST Collaborative Research Cycle on Synthetic Data
hjeffreywang/POC-LSTM-sigmoid-labelling
Proof of concept for signal labelling using a Pytorch LSTM nn
hjeffreywang/R_qualityControl_Practice
hjeffreywang/ragas
SOTA metrics for evaluating Retrieval Augmented Generation (RAG) pipelines
hjeffreywang/RepoToText
Turn an entire GitHub Repo into a single organized .txt file to use with LLM's (GPT-4, Claude Opus, Gemini, etc)
hjeffreywang/sd-webui-controlnet
WebUI extension for ControlNet
hjeffreywang/Simplified_Langchain_proofofconcept
Quick Test Overview Demo of a simplified, scalable, but service dependent question-answering utilizing Astra DB and LangChain, enhanced by Vector Search.
hjeffreywang/Stable-Diffusion-workflow
hjeffreywang/Testing-Kubernetes-Airflow-ETL
hjeffreywang/text-generation-webui
A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.
hjeffreywang/Youtube-Video-Summarizer
TESTING From youtube link, to text, and through chatGPT: summarizes key points of a video. This is made for analysis videos, podcasts, etc.