andr0idsensei
I'm a software engineer, currently working with Deep Learning for Computer Vision and with interests in Robotics and applied AI.
BoschCluj-Napoca
andr0idsensei's Stars
huggingface/lerobot
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
mit-han-lab/llm-awq
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
NVlabs/VILA
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
turingmotors/heron
Tsinghua-MARS-Lab/futr3d
Code for paper: FUTR3D: a unified sensor fusion framework for 3d detection
HorizonRobotics/Sparse4D
quic/ai-hub-models
The Qualcomm® AI Hub Models are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.
yuweihao/MM-Vet
MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities (ICML 2024)
darrenburns/elia
A snappy, keyboard-centric terminal user interface for interacting with large language models. Chat with ChatGPT, Claude, Llama 3, Phi 3, Mistral, Gemma and more.
google-research-datasets/conceptual-captions
Conceptual Captions is a dataset containing (image-URL, caption) pairs designed for the training and evaluation of machine learned image captioning systems.
uncbiag/Awesome-Foundation-Models
A curated list of foundation models for vision and language tasks
google-research/omniglue
Code release for CVPR'24 submission 'OmniGlue'
facebookresearch/habitat-sim
A flexible, high-performance 3D simulator for Embodied AI research.
OpenBMB/ChatDev
Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)
YuchuanTian/DiJiang
[ICML'24 Oral] The official code of "DiJiang: Efficient Large Language Models through Compact Kernelization", a novel DCT-based linear attention mechanism.
mhamilton723/FeatUp
Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024
zed-industries/zed
Code at the speed of thought – Zed is a high-performance, multiplayer code editor from the creators of Atom and Tree-sitter.
Future-Scholars/paperlib
An open-source academic paper management tool.
bonsai-rx/bonsai
The compiler, IDE, and standard library for the Bonsai visual programming language for reactive systems
Thinklab-SJTU/Awesome-LLM4AD
A curated list of awesome LLM for Autonomous Driving resources (continually updated)
huggingface/candle
Minimalist ML framework for Rust
THUDM/CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
facebookresearch/ijepa
Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive architecture."
lhoyer/MIC
[CVPR23] Official Implementation of MIC: Masked Image Consistency for Context-Enhanced Domain Adaptation
DCDmllm/Cheetah
vitoplantamura/OnnxStream
Lightweight inference library for ONNX files, written in C++. It can run SDXL on a RPI Zero 2 but also Mistral 7B on desktops and servers.
alaamaalouf/FollowAnything
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
ahmedbahaaeldin/From-0-to-Research-Scientist-resources-guide
Detailed and tailored guide for undergraduate students or anybody want to dig deep into the field of AI with solid foundation.