Pinned Repositories
-
矩阵求导转自https://zhuanlan.zhihu.com/p/25063314
2kenize
Upcoming ACL 2020 paper
ACE_Chinese_2005
pre-treatment for relation extraction task
acl2018-semantic-parsing-tutorial
Materials from the ACL 2018 tutorial on neural semantic parsing
acl2019_nested_ner
Source code for paper Neural Architectures for Nested NER through Linearization
IJCAI-17
MoChA-pytorch
PyTorch Implementation of "Monotonic Chunkwise Attention" (ICLR 2018)
Paper_Writing_Tips
PySimpleGUI-YOLO
A YOLO Artificial Intelligence algorithm demonstration using PySimpleGUI
wavelets
Python implementation of the wavelet analysis found in Torrence and Compo (1998)
sidney1994's Repositories
sidney1994/Paper_Writing_Tips
sidney1994/Awesome-Multi-label-Image-Recognition
Awesome Multi-label Image Recognition Paper List
sidney1994/Classical-Modern
非常全的文言文(古文)-现代文平行语料
sidney1994/clearml
ClearML - Auto-Magical CI/CD to streamline your ML workflow. Experiment Manager, MLOps and Data-Management
sidney1994/cmu_multilingual_speech
CMU multilingual speech repository
sidney1994/CNERTA
sidney1994/composer
library of speed-up algorithms for model training
sidney1994/dataset_difficulty
"Understanding Dataset Difficulty with V-Usable Information" (ICML 2022, outstanding paper)
sidney1994/extend
Entity Disambiguation as text extraction (ACL 2022)
sidney1994/facestar
Facestar dataset. High quality audio-visual recordings of human conversational speech.
sidney1994/famie
FAMIE: A Fast Active Learning Framework for Multilingual Information Extraction
sidney1994/FREDA
Fast and Flexible Data Annotation for Relation Extraction
sidney1994/huggingsound
HuggingSound: A toolkit for speech-related tasks based on HuggingFace's tools
sidney1994/IndicLink
IndicLink is a Multilingual Fact Linking (MFL) dataset of sentences and a set of WikiData facts (subject; relation; object) contained in each sentence. IndicLink contains sentences from English and 6 Indian languages - Hindi, Telugu, Tamil, Urdu, Gujarati and Assamese. The correct facts are chosen from an oracle of 4.7 million Wikidata facts with fact labels/descriptions available in these 7 languages. The dataset is intended only to act as a test set to evaluate models trained for the task of MFL. For more details, please see https://arxiv.org/abs/2109.14364
sidney1994/lab-website-template
(Pre-release) An easy-to-use, flexible website template for labs, with automatic citations, GitHub tag imports, pre-built components, and more
sidney1994/lightning-hydra-template
PyTorch Lightning + Hydra. A very user-friendly template for rapid and reproducible ML experimentation with best practices. ⚡🔥⚡
sidney1994/lingfeat
LingFeat - A Comprehensive Linguistic Features Extraction ToolKit for Readability Assessment
sidney1994/NeuralKG
sidney1994/NS-Dial
An Interpretable Neuro-Symbolic Framework for Task-Oriented Dialogue Generation
sidney1994/polytropon
sidney1994/python_plot_utils
A simple code for plotting figure, colorbar, and cropping with python
sidney1994/SELFRec
An open-source framework for self-supervised recommender systems.
sidney1994/TempEL
Repository for Temporal Entity Linking (TempEL), accepted to NeurIPS 2022 Dataset and Benchmarks
sidney1994/timelms
TimeLMs: Diachronic Language Models from Twitter
sidney1994/tools
实用工具:markdown写PPT、命令行自动演示工具、前端组件库等
sidney1994/torchstudio
sidney1994/txtai
💡 Build AI-powered semantic search applications
sidney1994/video2dataset
Easily create large video dataset from video urls
sidney1994/wikipedia-utils
Utility scripts for preprocessing Wikipedia texts for NLP
sidney1994/yahp
hyperparameter management