Pinned Repositories
01
The open-source language model computer
1d-tokenizer
This repo contains the code for our paper An Image is Worth 32 Tokens for Reconstruction and Generation
2xDaniel
AGI House Hackathon Demo 2
aaiela
actionsheets-streamlit
Streamlit front-end for the actionsheets package
AdamW-Triton-PyTorch
Can AdamW written in Triton be as performat as fused CUDA impl?
llmware
Providing enterprise-grade LLM-based development framework, tools, and fine-tuned models.
paper-qa
LLM Chain for answering questions from documents with citations
techthiyanes's Repositories
techthiyanes/alphafold3_1
AlphaFold 3 inference pipeline.
techthiyanes/ASR-TTS-paper-daily
Update ASR paper everyday
techthiyanes/browser-use
Open-Source Web Automation library with any LLM
techthiyanes/chonkie
🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library
techthiyanes/Cosmos-Tokenizer
A suite of image and video neural tokenizers
techthiyanes/data-portraits
Documenting large text datasets 🖼️ 📚
techthiyanes/datachain
AI-data warehouse to enrich, transform and analyze data from cloud storages
techthiyanes/DTLR
Handwritten Text Recognition and Character Detection
techthiyanes/Fast-LLM
techthiyanes/FlagAI
FlagAI (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale model.
techthiyanes/inseq
Interpretability for sequence generation models 🔍
techthiyanes/Integuru
The first AI agent that builds third-party integrations through reverse engineering platforms' internal APIs.
techthiyanes/LLaVA-KD
techthiyanes/LLMs-from-scratch
Implementing a ChatGPT-like LLM from scratch, step by step
techthiyanes/localGPT-Vision
Chat with your documents using Vision Language Models. This repo implements an End to End RAG pipeline with both local and proprietary VLMs
techthiyanes/MemGPT
Teaching LLMs memory management for unbounded context 📚🦙
techthiyanes/neo4j-runway
End to end solution for migrating CSV data into a Neo4j graph using an LLM for the data discovery and graph data modeling stages.
techthiyanes/nunchaku
SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
techthiyanes/Ollama_vision_rag
ollama vision rag
techthiyanes/OmniGen
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
techthiyanes/optimizers_dist_shampoo
For optimization algorithm research and development.
techthiyanes/outspeed
Python SDK to build realtime AI applications on voice and video.
techthiyanes/Protein_Diffusion
Diffusion Model for Protein Structure Generation
techthiyanes/pytester
Python Testing for Databricks
techthiyanes/sector
techthiyanes/seqio
Task-based datasets, preprocessing, and evaluation for sequence models.
techthiyanes/SpeechSplit
Unsupervised Speech Decomposition Via Triple Information Bottleneck
techthiyanes/swarms-examples
A vast array of examples for the enterprise-grade and production-ready swarms framework.
techthiyanes/TokenFormer
Official Implementation of TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters
techthiyanes/vidore-benchmark
Vision Document Retrieval (ViDoRe): Benchmark 👀. Evaluation code for the "ColPali: Efficient Document Retrieval with Vision Language Models" paper.