Pinned Repositories
ahocorasick-python
AC自动机python的实现,并进行了优化。 主要修复了 查询不准确的问题。
Alpaca-CoT
We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. Meanwhile, we created a new branch to build a Tabular LLM.(我们分别统一了丰富的IFT数据(如CoT数据,目前仍不断扩充)、多种训练效率方法(如lora,p-tuning)以及多种LLMs,三个层面上的接口,打造方便研究人员上手的LLM-IFT研究平台。同时tabular_llm分支构建了面向表格智能任务的LLM。
alpaca_chinese_dataset
人工精调的中文对话数据集和一段chatglm的微调代码
Chinese-Word-Vectors
100+ Chinese Word Vectors 上百种预训练中文词向量
kryo
Java serialization and cloning: fast, efficient, automatic
logistic-regression
A simple implementation of logisitic regression in Java
PCA
The practise of principal component analysis
Redis-Code
redis键值数据库源码分析
tfidf
A generic Tf-Idf utility with example code that works on n-grams extracted from a text document.
weka
weka mirror with git — http://www.cs.waikato.ac.nz/ml/weka/
ericshijian's Repositories
ericshijian/Chinese-Word-Vectors
100+ Chinese Word Vectors 上百种预训练中文词向量
ericshijian/ahocorasick-python
AC自动机python的实现,并进行了优化。 主要修复了 查询不准确的问题。
ericshijian/Alpaca-CoT
We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. Meanwhile, we created a new branch to build a Tabular LLM.(我们分别统一了丰富的IFT数据(如CoT数据,目前仍不断扩充)、多种训练效率方法(如lora,p-tuning)以及多种LLMs,三个层面上的接口,打造方便研究人员上手的LLM-IFT研究平台。同时tabular_llm分支构建了面向表格智能任务的LLM。
ericshijian/alpaca_chinese_dataset
人工精调的中文对话数据集和一段chatglm的微调代码
ericshijian/awesome-deep-vision
A curated list of deep learning resources for computer vision
ericshijian/Bangalore_Sentiment
Various NLP methods (in python) to perform sentiment analysis
ericshijian/Canova
General Vectorization Lib for Machine Learning Tools
ericshijian/darknet
darknet深度学习框架源码分析:详细中文注释,涵盖框架原理与实现语法分析
ericshijian/deep-pwning
Metasploit for machine learning.
ericshijian/developer-roadmap
Roadmap to becoming a web developer in 2017
ericshijian/doc2vec_pymongo
Machine learning prediction of movies genres using Gensim's Doc2Vec and PyMongo - (Python, MongoDB)
ericshijian/elasticsearch-definitive-guide
The Definitive Guide to Elasticsearch
ericshijian/emdl
Embedded and mobile deep learning research resources
ericshijian/ericshijian.github.io
ericshijian/gaft
A Genetic Algorithm Framework in Python
ericshijian/HanLP
自然语言处理 中文分词 词性标注 命名实体识别 依存句法分析 关键词提取 自动摘要 短语提取 拼音 简繁转换
ericshijian/hed
code for Holistically-Nested Edge Detection
ericshijian/IntelliJ-IDEA-Tutorial
IntelliJ IDEA 简体中文专题教程
ericshijian/Jcoder
Java Dynamic code or JAR , publish you Api or Schedule in flying
ericshijian/kungfu
Kungfu Master Trading System
ericshijian/label_ellipse
A tool to label ellipse object
ericshijian/librec
LibRec: A Java Library for Recommender Systems, see
ericshijian/medical-data
ericshijian/MTCNN-Tensorflow
Reproduce MTCNN using Tensorflow
ericshijian/NRE
Neural Relation Extraction, including CNN, PCNN, CNN+ATT, PCNN+ATT
ericshijian/pointer-generator
Code for the ACL 2017 paper "Get To The Point: Summarization with Pointer-Generator Networks" (Python3)
ericshijian/rcf
Richer Convolutional Features for Edge Detection
ericshijian/tensorflow
Computation using data flow graphs for scalable machine learning
ericshijian/TextSum
Preparing a dataset for TensorFlow text summarization (TextSum) model.
ericshijian/xgboost
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Flink and DataFlow