Pinned Repositories
-----
U-net Mask RCNN
adversarial
Code and hyperparameters for the paper "Generative Adversarial Networks"
Adversarial_Autoencoder
A wizard's guide to Adversarial Autoencoders
arxiv_paper_assistant
search academic papers from arxiv and free academic sources,utilize the free LLM api to extract the sentence summary
chinese_science_paper_to_text
读取多层级目录下的pdf文件,通常是爬虫爬下来的,将其中摘要和正文抽取出来。可以快速抽取想要的文本内容。
EDA-Easier-Data-Augment-for-chinese
NLP_DATA_AUGMENT
NLP_Datasets
中文NLP数据集
numpy_logistics_softmax_textclassification
kaggle上电影评论分类比赛的数据,numpy实现logistics+softmax分类。
pytorch_med_T5-large_scale_pretraining_and_fientune-
基于T5 和 mt5 模型的医学nlp大规模预训练模型的训练和验证,测试
stopwords
中文常用停用词表(哈工大停用词表、百度停用词表等)
flyingwaters's Repositories
flyingwaters/chinese_science_paper_to_text
读取多层级目录下的pdf文件,通常是爬虫爬下来的,将其中摘要和正文抽取出来。可以快速抽取想要的文本内容。
flyingwaters/pytorch_med_T5-large_scale_pretraining_and_fientune-
基于T5 和 mt5 模型的医学nlp大规模预训练模型的训练和验证,测试
flyingwaters/numpy_logistics_softmax_textclassification
kaggle上电影评论分类比赛的数据,numpy实现logistics+softmax分类。
flyingwaters/arxiv_paper_assistant
search academic papers from arxiv and free academic sources,utilize the free LLM api to extract the sentence summary
flyingwaters/AI-Collection
收集国内免费ChatGPT镜像,prompt,以及其他AI应用等 | Collect free ChatGPT mirrors, alternatives,prompt, other AI applications, etc.
flyingwaters/awesome-pretrained-chinese-nlp-models
Awesome Pretrained Chinese NLP Models,高质量中文预训练模型集合
flyingwaters/BERT-NER
Pytorch-Named-Entity-Recognition-with-BERT
flyingwaters/bigraph
Bipartite-network link prediction in Python
flyingwaters/Chinese-Vertical-landing-MoE
基于Chinese-mixtral-8*7b 拓展词表后,针对垂直行业继续拓展词表,预训练和指令微调的垂直行业Chinese-mixtral-8*7b大模型
flyingwaters/CR-Walker
Conversational Recommender System with Tree-structured Graph Reasoning and Dialog Acts
flyingwaters/deduplicate-text-datasets_in_small_memory
以deduplicate-text-datasets为基础, 增加python接口, 增加batch suffix 功能 , 摆脱内存 限, 方便小memory cs 可以避免使用不当
flyingwaters/DeepIE
DeepIE: Deep Learning for Information Extraction
flyingwaters/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.
flyingwaters/dice_loss_for_NLP
The repo contains the code of the ACL2020 paper `Dice Loss for Data-imbalanced NLP Tasks`
flyingwaters/Diffusion-Models-in-MedIA
A curated list of diffusion models in medical image analysis.
flyingwaters/flyingwaters.github.io
blog & blog theme🤘
flyingwaters/GNN_note
图神经网络整理
flyingwaters/JioNLP
中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com
flyingwaters/Matchings_improvement
schema matchings data quality improvement framework based on LLM service
flyingwaters/ML-and-DL
self-learning
flyingwaters/NER_common
一个领域NER的通用框架,领域数据迁移,NER训练和多fold集成
flyingwaters/nlp_paper_study
研读顶会论文,复现论文相关代码
flyingwaters/Ontobuilder-Research-Environment
flyingwaters/photo_based_streamlit
streamlit app
flyingwaters/practice_python
初学者的学习项目
flyingwaters/prompt-matcher-for-schema-matching
This is a project for schema match based on prompt of GPT-4 or LLMs, We get two SOTA results on two bench marks of schema matches.
flyingwaters/sklearn-scripts
Sklearn 框架中一些使用方法,data preprocessing 和一些model使用
flyingwaters/t-zero
Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)
flyingwaters/tcr_GPT
gpt
flyingwaters/uie_pytorch
PaddleNLP UIE模型的PyTorch版实现