h-terao's Stars
facebookresearch/vggt
[CVPR 2025] VGGT: Visual Geometry Grounded Transformer
incluud/accessible-astro-components
A collection of accessible components for Astro projects with built-in ARIA attributes, keyboard navigation and interactive elements. Easy to implement and customize to your needs.
OpenBMB/MiniCPM-o
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
khoj-ai/khoj
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.
microsoft/AIOpsLab
A holistic framework to enable the design, development, and evaluation of autonomous AIOps agents.
f0uriest/interpax
Interpolation and function approximation with JAX
facebookresearch/lingua
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
jax-ml/oryx
Oryx is a library for probabilistic programming and deep learning built on top of Jax.
uezo/ChatdollKit
ChatdollKit enables you to make your 3D model into a chatbot
google-research/self-organising-systems
lllyasviel/Omost
Your image is almost there!
HigherOrderCO/Bend
A massively parallel, high-level programming language
wetdog/wavenext_pytorch
Unofficial implementation of wavenext vocoder
MahmoudAshraf97/whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
RabbitBoss/Awesome-Realistic-Semi-Supervised-Learning
An awesome paper list of Semi-Supervised Learning under realistic settings.
reservoirpy/reservoirpy
A simple and flexible code for Reservoir Computing architectures like Echo State Networks
karpathy/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
xxh/xxh
🚀 Bring your favorite shell wherever you go through the ssh. Xonsh shell, fish, zsh, osquery and so on.
LiheYoung/Depth-Anything
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
microsoft/folx
Implementation of Forward Laplacian algorithm in JAX
numediart/EmoV-DB
The Emotional Voices Database: Towards Controlling the Emotional Expressiveness in Voice Generation Systems
jax-ml/bayeux
State of the art inference for your bayesian models.
bshall/ZeroSpeech
VQ-VAE for Acoustic Unit Discovery and Voice Conversion
e3nn/e3nn-jax
jax library for E3 Equivariant Neural Networks
MarkFzp/act-plus-plus
Imitation learning algorithms with Co-training for Mobile ALOHA: ACT, Diffusion Policy, VINN
Stirling-Tools/Stirling-PDF
#1 Locally hosted web application that allows you to perform various operations on PDF files
cumulo-autumn/StreamDiffusion
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
HumanAIGC/AnimateAnyone
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
LouisShark/chatgpt_system_prompt
A collection of GPT system prompts and various prompt injection/leaking knowledge.
google-research/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.