jamesliu

Amazonion

Bay JarvisPalo Alto, CA

Pinned Repositories

clojure_findutils
create utilities such as find
Language:Clojure4 5 06
gluon-tutorials-zh
通过MXNet/Gluon来动手学习深度学习
Language:Python1 3 00
kaggle_cifar-10_mxnet
Language:Jupyter Notebook16 4 03
machine-learning
Language:R1 4 01
nanoDPO
A nimble and innovative implementation of the Direct Preference Optimization (DPO) algorithm with Causal Transformer and LSTM model, inspired by the paper of DPO in fine-tuning unsupervised Language Models
Language:Python6 3 00
nanoPPO
An efficient implementation of the Proximal Policy Optimization (PPO) algorithm with linear and attention policy for reinforcement learning.
Language:Python9 3 00
nanoTransformer
A PyTorch-based featuring an efficiently implemented Transformer model. The core of our attention mechanisms is powered by torch.einsum, ensuring clean, readable, and highly optimized tensor operations.
Language:Python2 2 01
nChain
a flexible and efficient implementation to create LLM bots over extensible dataset.
Language:Python2 2 00
xgboost
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Flink and DataFlow
Language:C++1 3 00
autogen
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
Language:Jupyter Notebook36.2k 415 2.2k5.2k

jamesliu's Repositories

jamesliu/nanoDPO
A nimble and innovative implementation of the Direct Preference Optimization (DPO) algorithm with Causal Transformer and LSTM model, inspired by the paper of DPO in fine-tuning unsupervised Language Models
Language:Python6 3 00
jamesliu/nChain
a flexible and efficient implementation to create LLM bots over extensible dataset.
Language:Python2 2 00
jamesliu/amago
a simple and scalable agent for training adaptive policies with sequence-based RL
Language:Python1 0
jamesliu/anthropic-cookbook
A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.
Language:Jupyter Notebook1 0
jamesliu/autogen
Enable Next-Gen Large Language Model Applications. Join our Discord: https://discord.gg/pAbnFJrkgZ
Language:Jupyter Notebook1 0
jamesliu/bayjarvis-app
2 0
jamesliu/devika
Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika aims to be a competitive open-source alternative to Devin by Cognition AI.
Language:Python1 0
jamesliu/lag-llama
Language:Python1 0
jamesliu/litgpt
Hackable implementation of state-of-the-art open-source LLMs based on nanoGPT. Supports flash attention, 4-bit and 8-bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
Language:Python1 0
jamesliu/LLaMA-Factory
Unify Efficient Fine-tuning of 100+ LLMs
Language:Python1 0
jamesliu/llm-foundry
LLM training code for MosaicML foundation models
Language:Python1 0
jamesliu/llm.c
LLM training in simple, raw C/CUDA
Language:C1 0
jamesliu/LLMLingua
To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.
Language:Python1 0
jamesliu/llmtime
Language:Jupyter Notebook1 0
jamesliu/long-range-arena
Long Range Arena for Benchmarking Efficient Transformers
Language:Python1 0
jamesliu/magicoder
Magicoder: Source Code Is All You Need
Language:Python1 0
jamesliu/network_architecture
2 0
jamesliu/OpenMoE
A family of open-sourced Mixture-of-Experts (MoE) Large Language Models
Language:Python1 0
jamesliu/package-stats
Language:Python2 0
jamesliu/Pearl
A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.
jamesliu/pypi-bayjarvis-packages
Language:Shell2 0
jamesliu/recommendation_system
Language:Python2 0
jamesliu/repopack
📦 Repopack is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) or other AI tools like Claude, ChatGPT, and Gemini.
jamesliu/robo-advisor-with-python
Language:Jupyter Notebook0 0
jamesliu/routerbench
The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System
Language:Python1 0
jamesliu/ScaleLLM
A high-performance inference system for large language models, designed for production environments.
Language:C++1 0
jamesliu/SPIN
The official implementation of Self-Play Fine-Tuning (SPIN)
Language:Python1 0
jamesliu/Time-LLM
[ICLR 2024] Official implementation of "Time-LLM: Time Series Forecasting by Reprogramming Large Language Models"
Language:Python1 0
jamesliu/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Language:Python1 0
jamesliu/Transformers_Are_What_You_Dont_Need
The best repository showing why transformers don’t work in time series forecasting and showcasing the best SOTA non transformer models.
1 0