robertalanm

Building Internet Scale Machine Intelligence!

@manifold-inc Texas

Pinned Repositories

targon
A library for building subnets with the manifold reward stack
Language:Python25 10 124
bittensor
Internet-scale Neural Networks
Language:Python878 34 287306
bittensor-js
bittensor api, but for web applications
Language:TypeScript12
alpaca-weight
Train llama with lora on one 4090 and merge weight of lora to work as stanford alpaca.
Language:Python1 0 00
CodingSubnet
1 1 00
DALLE-2
fun ai work
Language:Python2 1 00
langchain
⚡ Building applications with LLMs through composability ⚡
Language:Python1 0 00
reward-modeling
Language:Python1 0 00
safe-rlhf
Safe-RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
Language:Python2 1 00
text-generation-inference
Large Language Model Text Generation Inference
Language:Python1 0 00

robertalanm's Repositories

robertalanm/safe-rlhf
Safe-RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
Language:Python2 1 00
robertalanm/alpaca-weight
Train llama with lora on one 4090 and merge weight of lora to work as stanford alpaca.
Language:Python1 0 00
robertalanm/CodingSubnet
1 1 00
robertalanm/langchain
⚡ Building applications with LLMs through composability ⚡
Language:Python1 0 00
robertalanm/reward-modeling
Language:Python1 0 00
robertalanm/text-generation-inference
Large Language Model Text Generation Inference
Language:Python1 0 00
robertalanm/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Language:Python1 0 00
robertalanm/airoboros
Customizable implementation of the self-instruct paper.
Language:Python0 0
robertalanm/alpaca-lora
Code for reproducing the Stanford Alpaca InstructLLaMA result on consumer hardware
Language:Jupyter Notebook
robertalanm/autocrit
A repository for transformer critique learning and generation
Language:Python0 0
robertalanm/axolotl
Go ahead and axolotl questions
Language:Python
robertalanm/ColossalAI
Making large AI models cheaper, faster and more accessible
Language:Python0 0
robertalanm/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
Language:Python
robertalanm/discord
Language:Python1 0
robertalanm/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
Language:Python0 0
robertalanm/H3
Language Modeling with the H3 State Space Model
Language:Assembly0 0
robertalanm/langflow
⛓️ LangFlow is a UI for LangChain, designed with react-flow to provide an effortless way to experiment and prototype flows.
Language:TypeScript0 0
robertalanm/langfuse
open-source observability for LLM applications
Language:TypeScript0 0
robertalanm/langfuse-python
Language:Python0 0
robertalanm/llama-trl
LLaMA-TRL: Fine-tuning LLaMA with PPO and LoRA
Language:Jupyter Notebook0 0
robertalanm/minimal-llama
Language:Python0 0
robertalanm/OpenLLaMA2
A Ray-based High-performance LLaMA2 RLHF framework
Language:Python0 0
robertalanm/opentensorAI-connector-template
Language:JavaScript0 0
robertalanm/orca
Experiments into reproducing orca
1 0
robertalanm/pfrl
PFRL: a PyTorch-based deep reinforcement learning library
Language:Python0 0
robertalanm/raodottown
website for rao.town
Language:JavaScript1 0
robertalanm/substrate-indexer
indexer for substrate chain (bt)
Language:TypeScript1 0
robertalanm/t-jepa
Language:Python1 0
robertalanm/validators
Repository for bittensor validators
Language:Python0 0
robertalanm/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python0 0