Pinned Repositories
aws-autonomous-driving-data-lake-image-extraction-pipeline-from-ros-bagfiles
This workshop will familiarize you with some of the key steps towards building an autonomous driving data lake and extracting images from ROS bag files. Using these images, you will be able label them using SageMaker Ground Truth and fine-tuning a Machine Learning Model to detect cars.
InternLM
Official release of InternLM2.5 base and chat models. 1M context support
opendatalab-python-sdk
SDK of OpenDataLab - https://opendatalab.org.cn
30-Days-Of-Python
30 days of Python programming challenge is a step-by-step guide to learn the Python programming language in 30 days. This challenge may take more than100 days, follow your own pace.
AISystem
AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
algo
数据结构和算法必知必会的50个代码实现
ColossalAI
Making big AI models cheaper, easier, and more scalable
flink
Apache Flink
hikCam
thzImageProcess
using python opencv and numpy module to filter,edge detection,find countour
varuy322's Repositories
varuy322/AISystem
AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
varuy322/Awesome-Chinese-LLM
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
varuy322/Awesome-DIffusionModels-paper
A curasted list of papers with the topic of Diffusion Models for Multi-Modal
varuy322/Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
varuy322/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.
varuy322/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
varuy322/coding-interview-university
A complete computer science study plan to become a software engineer.
varuy322/data-algorithms-with-spark
O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian
varuy322/data-selection-survey
This is a collection of research papers for A Survey on Data Selection for Language Models
varuy322/datacomp
DataComp: In search of the next generation of multimodal datasets
varuy322/faiss
A library for efficient similarity search and clustering of dense vectors.
varuy322/hello-algo
《Hello 算法》:动画图解、一键运行的数据结构与算法教程,支持 Python, C++, Java, C#, Go, Swift, JS, TS, Dart, Rust, C, Zig 等语言。English edition ongoing
varuy322/InternLM
InternLM has open-sourced a 7 billion parameter base model, a chat model tailored for practical scenarios and the training system.
varuy322/LLaMA-Factory
Unify Efficient Fine-Tuning of 100+ LLMs
varuy322/llama-recipes
Examples and recipes for Llama 2 model
varuy322/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
varuy322/llm-action
本项目旨在分享大模型相关技术原理以及实战经验。
varuy322/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
varuy322/LLMs-from-scratch
Implementing a ChatGPT-like LLM from scratch, step by step
varuy322/LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
varuy322/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLM
varuy322/MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
varuy322/mmc4
MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.
varuy322/opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
varuy322/QuRating
Select LM Training Data Based on Qualitative Aspects of Text
varuy322/rags
Build ChatGPT over your data, all with natural language
varuy322/Transformers-Tutorials
This repository contains demos I made with the Transformers library by HuggingFace.
varuy322/varuy322
varuy322/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
varuy322/Yi
A series of large language models trained from scratch by developers @01-ai