Pinned Repositories
audit-logs-processor
Application to process logs from MapR Audit Streams and send it to OpenTSDB
Awesome-System-for-Machine-Learning
A curated list of research in machine learning system. I also summarize some papers if I think they are really interesting.
cm_api
Cloudera Manager API Client
CUDCOS_auto_deployment
DCOS自动部署
customer360
Customer 360 analytics powered by MapR
dcos
DC/OS Build and Release tools
desktop-app
Leanote's desktop app, based on Electron(atom-shell), NW(node-webkit) http://leanote.org
ds-cheatsheets
List of Data Science Cheatsheets to rule the world
dsri-documentation
Documentation for the Data Science Research Infrastructure at Maastricht University (OpenShift with MapR cluster)
gas_station
流量加油站
corersky's Repositories
corersky/ai00_server
A localized open-source AI server that is better than ChatGPT.
corersky/AISystem
AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
corersky/awesome-DeepLearning
深度学习入门课、资深课、特色课、学术案例、产业实践案例、深度学习知识百科及面试题库The course, case and knowledge of Deep Learning and AI
corersky/awesome_Chinese_medical_NLP
中文医学NLP公开资源整理:术语集/语料库/词向量/预训练模型/知识图谱/命名实体识别/QA/信息抽取/模型/论文/etc
corersky/CBLUE
中文医疗信息处理基准CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark
corersky/ChatGPT
🔮 ChatGPT Desktop Application (Mac, Windows and Linux)
corersky/ChatGPT-Midjourney
🍭 一键拥有你自己的 ChatGPT+Midjourney 网页服务 | Own your own ChatGPT+Midjourney web service with one click
corersky/ChatGPT-Next-Web
A well-designed cross-platform ChatGPT UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT 应用。
corersky/Chinese-medical-dialogue-data
Chinese medical dialogue data 中文医疗对话数据集
corersky/CSGHub
CSGHub is an opensource large model assets platform just like on-premise huggingface which helps to manage datasets, model files, codes and more. CSGHub是一个开源、可信的大模型资产管理平台,可帮助用户治理LLM和LLM应用生命周期中涉及到的资产(数据集、模型文件、代码等)。CSGHub提供类似私有化的Huggingface功能,以类似OpenStack Glance管理虚拟机镜像、Harbor管理容器镜像以及Sonatype Nexus管理制品的方式,实现对LLM资产的管理。欢迎关注反馈和Star⭐️
corersky/cube-studio
cube studio开源云原生一站式机器学习/深度学习/大模型AI平台,支持sso登录,多租户,大数据平台对接,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式训练,超参搜索,推理服务VGPU,边缘计算,serverless,标注平台,自动化标注,数据集管理,大模型微调,vllm大模型推理,llmops,私有知识库,AI模型应用商店,支持模型一键开发/推理/微调,支持国产cpu/gpu/npu芯片,支持RDMA,支持pytorch/tf/mxnet/deepspeed/paddle/colossalai/horovod/spark/ray/volcano分布式
corersky/CUDA-Learn-Notes
🎉CUDA/C++ 笔记 / 技术博客: fp32、fp16/bf16、fp8/int8、flash_attn、sgemm、sgemv、warp/block reduce、dot prod、elementwise、softmax、layernorm、rmsnorm、hist etc.
corersky/CVPR2023-Papers-with-Code
CVPR 2023 论文和开源项目合集
corersky/daily-paper-computer-vision
记录每天整理的计算机视觉/深度学习/机器学习相关方向的论文
corersky/Deep-Learning-Interview-Book
深度学习面试宝典(含数学、机器学习、深度学习、计算机视觉、自然语言处理和SLAM等方向)
corersky/DragGAN
Online Demo and Unofficial Implementation of DragGAN - "Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold" (DragGAN 全功能实现,在线Demo,本地部署试用,代码、模型已全部开源,支持Windows, macOS, Linux)
corersky/Firefly
Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
corersky/flash-linear-attention
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
corersky/generative-ai-on-aws
Generative AI on AWS
corersky/Huggingface_Toturials
bert-base-chinese example
corersky/ignite
Apache Ignite
corersky/jiron-cloud
该项目整合了多款优秀的开源产品,构建了一个功能全面的数据开发平台。平台提供了强大的数据集成、数据开发、数据查询、数据服务、数据质量管理、工作流调度和元数据管理功能。#dinky #dolphinscheduler #datavines #flinkcdc #openmetadata #flink #数据开发 #数据平台 # 数据开发平台 #大数据
corersky/label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
corersky/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
corersky/LLMHub
LLM Study Note
corersky/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
corersky/PromptCBLUE
PromptCBLUE: a large-scale instruction-tuning dataset for multi-task and few-shot learning in the medical domain in Chinese
corersky/RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
corersky/step_into_llm
MindSpore online courses: Step into LLM
corersky/transformers-code
手把手带你实战 Huggingface Transformers 课程视频同步更新在B站与YouTube