minyang-chen

I am AI/LLM Enthusiast. Software (Data,ML,Cloud) Engineer, Enterprise Architect and Tech Lead on helping business solving problems.

https://github.com/PavAI-ResearchOntario, Canada

Pinned Repositories

chain-of-thoughts-agent
Chain of Thoughts is a MRKL system - a modular, neuro-symbolic architecture that combines large language models, external knowledge sources and discrete reasoning built on top of LLMs.
Language:Python1 1 00
chatgpt_like_experience_locally
mimic chatgpt like experience locally using latest open source LLM models
Language:TypeScript2 1 00
distributed_train_finetune
Experiement with LLM Distributed Train and Fine-Tuning
Language:Python13
h20_llm
Fine-tuning an LLM model with H2O LLM Studio to generate Cypher statements Avoid depending on external and ever changing APIs for your knowledge graph based chatbot
Language:Jupyter Notebook3 1 01
intuitive_thinker
To enhance the reasoning capabilities of smaller-sized language models, employ a system of thinking that incorporates mental models, structured Chain-of-Thought processes, and thoughtful reflection before responding to user queries.
Language:Python3 1 01
Knowledge_Distillation_Training
employ knowledge distillation to compress their large deep models into lightweight versions (Teacher and Student Model)
Language:Jupyter Notebook2 1 01
LLM_convert_receipt_image-to-json_or_xml
Finetune LLM to convert an invoice or receipt image to receipt XML or JSON object.
Language:Jupyter Notebook37 1 313
single-node-slurm-cluster-docker
fully dockerized single-node slurm cluster with GPU support
Language:Shell2 1 00
tinyllama_colorist
finetune tinyllama to generate color code
Language:Jupyter Notebook5 1 14

minyang-chen's Repositories

minyang-chen/LLM_convert_receipt_image-to-json_or_xml
Finetune LLM to convert an invoice or receipt image to receipt XML or JSON object.
Language:Jupyter Notebook37 1 313
minyang-chen/tinyllama_colorist
finetune tinyllama to generate color code
Language:Jupyter Notebook5 1 14
minyang-chen/intuitive_thinker
To enhance the reasoning capabilities of smaller-sized language models, employ a system of thinking that incorporates mental models, structured Chain-of-Thought processes, and thoughtful reflection before responding to user queries.
Language:Python3 1 01
minyang-chen/chatgpt_like_experience_locally
mimic chatgpt like experience locally using latest open source LLM models
Language:TypeScript2 1 00
minyang-chen/Knowledge_Distillation_Training
employ knowledge distillation to compress their large deep models into lightweight versions (Teacher and Student Model)
Language:Jupyter Notebook2 1 01
minyang-chen/single-node-slurm-cluster-docker
fully dockerized single-node slurm cluster with GPU support
Language:Shell2 1 00
minyang-chen/distributed_train_finetune
Experiement with LLM Distributed Train and Fine-Tuning
Language:Python13
minyang-chen/multi-nodes-slurm-cluster-docker
fully dockerized distributed multi-nodes slurm cluster - ubuntu 20.04
Language:Dockerfile1 1 10
minyang-chen/slurm-job-samples
Slurm Job Samples encapsulate GPU resources
Language:Python1 2 00
minyang-chen/AutoAWQ
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference.
Language:C++0 0
minyang-chen/autogen
Enable Next-Gen Large Language Model Applications. Join our Discord: https://discord.gg/pAbnFJrkgZ
Language:Jupyter Notebook0 0
minyang-chen/axolotl
Go ahead and axolotl questions
Language:Python0 0
minyang-chen/discord_bot
BIDARA is a GPT-4 chatbot that was instructed to help scientists and engineers understand, learn from, and emulate the strategies used by living things to create sustainable designs and technologies using the Biomimicry Institute's step-by-step design process.
Language:Python0 0
minyang-chen/gguf-chatbot-ui
An open source ChatGPT UI. (for GGUF models)
Language:TypeScript0 0
minyang-chen/llama2.c
Inference Llama 2 in one file of pure C
Language:Python0 0
minyang-chen/LLaVA
Visual Instruction Tuning: Large Language-and-Vision Assistant built towards multimodal GPT-4 level capabilities.
Language:Python0 0
minyang-chen/llm-awq
AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
Language:Python0 0
minyang-chen/llm_fast_inference_from_HF_via_speculative_decoding
evaluate Speculative Decoding that promising 2-3X speedups of LLM inference by running two models in parallel.
Language:Python1 0
minyang-chen/Local-LLM-Comparison-Colab-UI
Compare the performance of different LLM that can be deployed locally on consumer hardware. Run yourself with Colab WebUI.
Language:Jupyter Notebook0 0
minyang-chen/mctodo
a simple yet colorful CLI app to keep track of my todo list.
Language:Python1 0
minyang-chen/Medusa
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
minyang-chen/minyang-chen
1 0
minyang-chen/mojo
The Mojo Programming Language
Language:Mojo0 0
minyang-chen/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Language:Python0 0
minyang-chen/paligemma-receipt-json-v2
demo usage of paligemma extraction of receipt image to json object
Language:Python
minyang-chen/piper
A fast, local neural text to speech system
Language:C++0 0
minyang-chen/RLHF_example
Reinforcement learning from human feedback (RLHF) Movie Reviews Example
Language:Jupyter Notebook1 0
minyang-chen/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Language:Python0 0
minyang-chen/strictjson
A Strict JSON Framework for LLM Outputs
Language:Jupyter Notebook0 0
minyang-chen/Video-LLaVA
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
Language:Python0 0