Frankluox

Ph.D student | AIGC, Agents, LLMs

UESTCChengdu, China

Frankluox's Stars

RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Language:Python33.1k 202 1.2k3.8k
2noise/ChatTTS
A generative speech model for daily dialogue.
Language:Python31k 179 5153.4k
princeton-nlp/SWE-agent
SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges.
Language:Python13.4k 97 3741.3k
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
Language:Python12.1k 100 529847
axolotl-ai-cloud/axolotl
Go ahead and axolotl questions
Language:Python7.6k 49 648835
lllyasviel/Omost
Your image is almost there!
Language:Python7.2k 44 78418
OpenBMB/MiniCPM
MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.
Language:Jupyter Notebook6.9k 74 202440
huggingface/lerobot
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
Language:Python6.5k 70 104579
arcee-ai/mergekit
Tools for merging pretrained large language models.
Language:Python4.6k 50 297403
facebookresearch/schedule_free
Schedule-Free Optimization in PyTorch
Language:Python1.8k 16 2963
NVlabs/VILA
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
Language:Python1.8k 27 119146
google-deepmind/penzai
A JAX research toolkit for building, editing, and visualizing neural networks.
Language:Python1.6k 17 1550
dora-rs/dora
DORA (Dataflow-Oriented Robotic Architecture) is middleware designed to streamline and simplify the creation of AI-based robotic applications. It offers low latency, composable, and distributed dataflow capabilities. Applications are modeled as directed graphs, also referred to as pipelines.
Language:Rust1.5k 30 13677
google-deepmind/mujoco_menagerie
A collection of high-quality models for the MuJoCo physics engine, curated by Google DeepMind.
Language:Jupyter Notebook1.3k 26 67174
PufferAI/PufferLib
Simplifying reinforcement learning for complex game environments
Language:Python1.1k 5 1042
prometheus-eval/prometheus-eval
Evaluate your LLM's response with Prometheus and GPT4 💯
Language:Python762 3 3047
IDEA-Research/Grounding-DINO-1.5-API
API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series
Language:Python728 11 4021
OpenTeleVision/TeleVision
[CoRL 2024] Open-TeleVision: Teleoperation with Immersive Active Visual Feedback
Language:Python578 8 2956
robfiras/loco-mujoco
Imitation learning benchmark focusing on complex locomotion tasks using MuJoCo.
Language:Python536 8 3646
robocasa/robocasa
RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots
Language:Python516 9 5836
AIGText/Glyph-ByT5
[ECCV2024] This is an official inference code of the paper "Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering" and "Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Multilingual Visual Text Rendering""
Language:Jupyter Notebook489 17 1621
OpenRobotLab/GRUtopia
GRUtopia: Dream General Robots in a City at Scale
Language:Python469 11 1922
maitrix-org/Pandora
Pandora: Towards General World Model with Natural Language Actions and Video States
Language:Python465 17 733
bigcode-project/starcoder2-self-align
StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation
Language:Python221 6 614
AILab-CVC/CV-VAE
CV-VAE: A Compatible Video VAE for Latent Generative Video Models
Language:Jupyter Notebook211 14 136
mihirp1998/VADER
Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various reward models such as HPS, PickScore, VideoMAE, VJEPA, YOLO, Aesthetics etc.
Language:Python195 7 1415
google-research/android_world
AndroidWorld is an environment and benchmark for autonomous agents
Language:Python95 3 65
jonzamora/awesome-robot-learning-envs
A list of awesome and popular robot learning environments
87 1 01
mathvision-cuhk/MATH-V
MATH-Vision dataset and code to measure Multimodal Mathematical Reasoning capabilities.
Language:Python57 1 24
DCDmllm/MorphTokens
Language:Python40 2 41

Frankluox

Frankluox's Stars

RVC-Boss/GPT-SoVITS

2noise/ChatTTS

princeton-nlp/SWE-agent

OpenBMB/MiniCPM-V

axolotl-ai-cloud/axolotl

lllyasviel/Omost

OpenBMB/MiniCPM

huggingface/lerobot

arcee-ai/mergekit

facebookresearch/schedule_free

NVlabs/VILA

google-deepmind/penzai

dora-rs/dora

google-deepmind/mujoco_menagerie

PufferAI/PufferLib

prometheus-eval/prometheus-eval

IDEA-Research/Grounding-DINO-1.5-API

OpenTeleVision/TeleVision

robfiras/loco-mujoco

robocasa/robocasa

AIGText/Glyph-ByT5

OpenRobotLab/GRUtopia

maitrix-org/Pandora

bigcode-project/starcoder2-self-align

AILab-CVC/CV-VAE

mihirp1998/VADER

google-research/android_world

jonzamora/awesome-robot-learning-envs

mathvision-cuhk/MATH-V

DCDmllm/MorphTokens