Samox1
Ph.D. student, engineer (photonics) & data/ml enthusiast
Warsaw University of TechnologyPoland
Samox1's Stars
HKUDS/LightRAG
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
huggingface/datatrove
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
gpt-omni/mini-omni2
Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。
LLaVA-VL/LLaVA-NeXT
patronus-ai/financebench
crewAIInc/crewAI
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
EnVision-Research/Lotus
Official Implementation of LOTUS: Diffusion-based Visual Foundation Model for High-quality Dense Prediction
deepseek-ai/Janus
Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation
corbt/agent.exe
NVIDIA/TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
neulab/Pangea
This is the repo for the paper "PANGEA: A FULLY OPEN MULTILINGUAL MULTIMODAL LLM FOR 39 LANGUAGES"
VikParuchuri/marker
Convert PDF to markdown quickly with high accuracy
PacktPublishing/LLM-Engineers-Handbook
The LLM's practical guide: From the fundamentals to deploying advanced LLM and RAG apps to AWS using LLMOps best practices
pola-rs/polars-benchmark
hijkzzz/Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
mit-han-lab/duo-attention
DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads
apple/ml-depth-pro
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.
QwenLM/Qwen2-VL
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
thunlp/LLaVA-UHD
LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images
RLHF-V/RLAIF-V
RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness
saurabhlalsaxena/Perplexity-Clone-v0.1
This is a repository which uses LangChain LangGraph and DuckduckGo to create a Perplexity Clone
HandsOnLLM/Hands-On-Large-Language-Models
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
openai/openai-realtime-console
React app for inspecting, building and debugging with the Realtime API
open-compass/VLMEvalKit
Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 40+ benchmarks
mlfoundations/open_clip
An open source implementation of CLIP.
rom1504/img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
GbotHQ/ocr-dataset-rendering
ShareGPT4Omni/ShareGPT4Video
[NeurIPS 2024] An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
meta-llama/PurpleLlama
Set of tools to assess and improve LLM security.
ollama/ollama-python
Ollama Python library