tjhwk's Stars
practical-tutorials/project-based-learning
Curated list of project-based tutorials
langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
GokuMohandas/Made-With-ML
Learn how to design, develop, deploy and iterate on production-grade ML applications.
OI-wiki/OI-wiki
:star2: Wiki of OI / ICPC for everyone. (某大型游戏线上攻略,内含炫酷算术魔法)
AlDanial/cloc
cloc counts blank lines, comment lines, and physical lines of source code in many programming languages.
mlflow/mlflow
Open source platform for the machine learning lifecycle
joke2k/faker
Faker is a Python package that generates fake data for you.
dromara/Sa-Token
一个轻量级 Java 权限认证框架,让鉴权变得简单、优雅!—— 登录认证、权限认证、分布式Session会话、微服务网关鉴权、单点登录、OAuth2.0
NLP-LOVE/ML-NLP
此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现,也是作为一个算法工程师必会的理论基础知识。
HqWu-HITCS/Awesome-Chinese-LLM
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
datawhalechina/leedl-tutorial
《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases
dagster-io/dagster
An orchestration platform for the development, production, and observation of data assets.
dagger/dagger
An engine to run your pipelines in containers
delta-io/delta
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
apache/zeppelin
Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.
orchest/orchest
Build data pipelines, the easy way 🛠️
tencentmusic/cube-studio
cube studio开源云原生一站式机器学习/深度学习/大模型AI平台,支持sso登录,多租户,大数据平台对接,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式训练,超参搜索,推理服务VGPU,边缘计算,serverless,标注平台,自动化标注,数据集管理,大模型微调,vllm大模型推理,llmops,私有知识库,AI模型应用商店,支持模型一键开发/推理/微调,支持国产cpu/gpu/npu芯片,支持RDMA,支持pytorch/tf/mxnet/deepspeed/paddle/colossalai/horovod/spark/ray/volcano分布式
alldatacenter/alldata
🔥🔥 AllData可定义数据中台,以数据平台为底座,以数据中台为桥梁,以机器学习平台为工厂,以大模型应用为上游产品,提供全链路数字化解决方案。采购商业版、加入技术社区:https://docs.qq.com/doc/DVHlkSEtvVXVCdEFo
pac4j/pac4j
Security engine for Java (authentication, authorization, multi frameworks): OAuth, CAS, SAML, OpenID Connect, LDAP, JWT...
curiousily/Getting-Things-Done-with-Pytorch
Jupyter Notebook tutorials on solving real-world problems with Machine Learning & Deep Learning using PyTorch. Topics: Face detection with Detectron 2, Time Series anomaly detection with LSTM Autoencoders, Object Detection with YOLO v5, Build your first Neural Network, Time Series forecasting for Coronavirus daily cases, Sentiment Analysis with BER
apache/kyuubi
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
langchain-ai/langserve
LangServe 🦜️🏓
birdLark/LarkMidTable
LarkMidTable 是一站式开源的数据中台,实现中台的 基础建设,数据治理,数据开发,监控告警,数据服务,数据的可视化,实现高效赋能数据前台并提供数据服务的产品。
cloudpipe/cloudpickle
Extended pickling support for Python objects
insitro/redun
Yet another redundant workflow engine
AndyJZhao/GLEM
Spico197/Mirror
🪞A powerful toolkit for almost all the Information Extraction tasks.
xusenlinzy/lit-ie
A training and inference framework for open ner and re models! 信息抽取模型的统一训练和推理框架,包含丰富的开源SOTA模型
getindata/streaming-jupyter-integrations
kundtx/Career_Platform_with_Demo
Source Code and Demo for paper CareerMiner: Automatic Extraction of Professional Network from Large Chinese Resume Data