golkir's Stars
pytorch/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
py-why/dowhy
DoWhy is a Python library for causal inference that supports explicit modeling and testing of causal assumptions. DoWhy is based on a unified language for causal inference, combining causal graphical models and potential outcomes frameworks.
adam2392/causal-networkx
A lightweight graph class library relevant for causal inference.
RManLuo/reasoning-on-graphs
Official Implementation of ICLR 2024 paper: "Reasoning on Graphs: Faithful and Interpretable Large Language Model Reasoning"
spcl/graph-of-thoughts
Official Implementation of "Graph of Thoughts: Solving Elaborate Problems with Large Language Models"
golkir/llama2-7b-minidatabricks
Finetuning Llama2 with modified Databricks Dolly dataset
facebookresearch/pytorchvideo
A deep learning library for video understanding research.
microsoft/markitdown
Python tool for converting files and office documents to Markdown.
limacv/GaussianSplattingViewer
Tiny Gaussian Splatting Viewer
ictnlp/LLaMA-Omni
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
gpt-omni/mini-omni
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
cagostino/npcsh
The AI toolkit for the AI developer
TheBlewish/Automated-AI-Web-Researcher-Ollama
A python program that turns an LLM, running on Ollama, into an automated researcher, which will with a single query determine focus areas to investigate, do websearches and scrape content from various relevant websites and do research for you all on its own! And more, not limited to but including saving the findings for you!
huggingface/speech-to-speech
Speech To Speech: an effort for an open-sourced and modular GPT4-o
Mozer/talk-llama-fast
Port of OpenAI's Whisper model in C/C++ with xtts and wav2lip
snakers4/silero-models
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
bklieger-groq/g1
g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains
1989Ryan/llm-mcts
[NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling better-reasoned decision-making for daily task planning problems.
av/harbor
Effortlessly run LLM backends, APIs, frontends, and services with one command.
open-webui/open-webui
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
simple-bench/SimpleBench
microsoft/BitNet
Official inference framework for 1-bit LLMs
cpldcpu/MisguidedAttention
A collection of prompts to challenge the reasoning abilities of large language models in presence of misguiding information
LiveBench/LiveBench
LiveBench: A Challenging, Contamination-Free LLM Benchmark
hendrycks/test
Measuring Massive Multitask Language Understanding | ICLR 2021
fchollet/ARC-AGI
The Abstraction and Reasoning Corpus
PKU-YuanGroup/LLaVA-CoT
LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning
ant-research/MagicQuill
[CVPR'25] Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System
QwenLM/Qwen2.5-Coder
Qwen2.5-Coder is the code version of Qwen2.5, the large language model series developed by Qwen team, Alibaba Cloud.