dtotsila
Working on LLMs/VLMs and Robotics. Doctoral Researcher, Centre Inria de l'Université de Lorraine
InriaNancy, France
dtotsila's Stars
google/gemma.cpp
lightweight, standalone C++ inference engine for Google's Gemma models.
google/gemma_pytorch
The official PyTorch implementation of Google's Gemma models
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
mlfoundations/open_flamingo
An open-source framework for training large multimodal models.
Vision-CAIR/VisualGPT
VisualGPT, CVPR 2022 Proceeding, GPT as a decoder for vision-language models
Uberi/speech_recognition
Speech recognition module for Python, supporting several engines and APIs, online and offline.
mallorbc/whisper_mic
Project that allows one to use a microphone with OpenAI whisper.
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
NVlabs/BundleSDF
[CVPR 2023] BundleSDF: Neural 6-DoF Tracking and 3D Reconstruction of Unknown Objects
Zaloog/kanban-python
Kanban Terminal App written in Python
apple/ml-mgie
time-series-foundation-models/lag-llama
Lag-Llama: Towards Foundation Models for Probabilistic Time Series Forecasting
hucebot/mj_cll
A library for Inverse/Forward Kinematics of Hybrid Serial-Parallel Closed Chains based on Mujoco.
elbuco1/AttentionMechanismsTrajectoryPrediction
In this repository, one can find the code for my master's thesis project. The main goal of the project was to study and improve attention mechanisms for trajectory prediction of moving agents.
rasbt/LLMs-from-scratch
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
unslothai/unsloth
Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
f3rm/f3rm
F3RM: Feature Fields for Robotic Manipulation. Official repo for the paper "Distilled Feature Fields Enable Few-Shot Language-Guided Manipulation" (CoRL 2023).
j-min/CLIP-Caption-Reward
PyTorch code for "Fine-grained Image Captioning with CLIP Reward" (Findings of NAACL 2022)
pharmapsychotic/clip-interrogator
Image to prompt with BLIP and CLIP
axolotl-ai-cloud/axolotl
Go ahead and axolotl questions
ollama/ollama-python
Ollama Python library
mlfoundations/open_clip
An open source implementation of CLIP.
NVlabs/FoundationPose
[CVPR 2024 Highlight] FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
jxnl/instructor
structured outputs for llms
outlines-dev/outlines
Structured Text Generation
1rgs/jsonformer
A Bulletproof Way to Generate Structured JSON from Language Models
wenbowen123/iros20-6d-pose-tracking
[IROS 2020] se(3)-TrackNet: Data-driven 6D Pose Tracking by Calibrating Image Residuals in Synthetic Domains
vimalabs/VIMA
Official Algorithm Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"
huggingface/autotrain-advanced
🤗 AutoTrain Advanced