HuaizhengZhang's Stars
langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
localstack/localstack
💻 A fully functional local AWS cloud stack. Develop and test your cloud & Serverless apps offline
langflow-ai/langflow
Langflow is a low-code app builder for RAG and multi-agent AI applications. It’s Python-based and agnostic to any model, API, or database.
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
osquery/osquery
SQL powered operating system instrumentation, monitoring, and analytics.
joonspk-research/generative_agents
Generative Agents: Interactive Simulacra of Human Behavior
meta-llama/llama-recipes
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.
ludwig-ai/ludwig
Low-code framework for building custom LLMs, neural networks, and other AI models
RUCAIBox/LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
datahub-project/datahub
The Metadata Platform for your Data and AI Stack
great-expectations/great_expectations
Always know what to expect from your data.
Mooler0410/LLMsPracticalGuide
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
datafuselabs/databend
𝗗𝗮𝘁𝗮, 𝗔𝗻𝗮𝗹𝘆𝘁𝗶𝗰𝘀 & 𝗔𝗜. Modern alternative to Snowflake. Cost-effective and simple for massive-scale analytics. https://databend.com
tobymao/sqlglot
Python SQL Parser and Transpiler
gkamradt/langchain-tutorials
Overview and tutorial of the LangChain Library
open-metadata/OpenMetadata
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.
Doragd/Algorithm-Practice-in-Industry
搜索、推荐、广告、用增等工业界实践文章收集(来源:知乎、Datafuntalk、技术公众号)
DAGWorks-Inc/hamilton
Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.
MarquezProject/marquez
Collect, aggregate, and visualize a data ecosystem's metadata
OpenLineage/OpenLineage
An Open Standard for lineage metadata collection
guyulongcs/Awesome-Deep-Learning-Papers-for-Search-Recommendation-Advertising
Awesome Deep Learning papers for industrial Search, Recommendation and Advertisement. They focus on Embedding, Matching, Ranking (CTR/CVR prediction), Post Ranking, Large Model (Generative Recommendation, LLM), Transfer learning, Reinforcement Learning and so on.
quiltdata/quilt
Quilt is a data mesh for connecting people with actionable data
evalplus/evalplus
Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024
WeOpenML/PandaLM
ECNU-ICALK/EduChat
An open-source educational chat model from ICALK, East China Normal University. 开源中英教育对话大模型。(通用基座模型,GPU部署,数据清理) 致敬: LLaMA, MOSS, BELLE, Ziya, vLLM
OpenMOSS/CoLLiE
Collaborative Training of Large Language Models in an Efficient Way
cli99/llm-analysis
Latency and Memory Analysis of Transformer Models for Training and Inference
astronomer/astro-sdk
Astro SDK allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.
illuminascent/img2lut
Creates 3dlut via an image
MSNLAB/ActVideo
Efficient edge video analytics platform via active continuous learning.