yfhsu's Stars
zhoumingyi/ModelObfuscator
Code for our paper "Modelobfuscator: Obfuscating Model Information to Protect Deployed ML-Based Systems" that has been published by ISSTA'23
kubernetes/sample-controller
Repository for sample controller. Complements sample-apiserver
mmumshad/kubernetes-the-hard-way
Bootstrap Kubernetes the hard way on Vagrant on Local Machine. No scripts.
xitongsys/parquet-go-source
source provider for parquet-go
rapidsai/cudf
cuDF - GPU DataFrame Library
saurfang/spark-knn
k-Nearest Neighbors algorithm on Spark
snorkel-team/snorkel
A system for quickly generating training data with weak supervision
priyankavergadia/google-cloud-4-words
The Google Cloud Developer's Cheat Sheet
jwt-scala/jwt-scala
JWT support for Scala. Bonus extensions for Play, Play JSON, Json4s, Circe, uPickle, Spray and Argonaut
fastforwardlabs/few-shot-text-classification
neubig/lowresource-nlp-bootcamp-2020
The website for the CMU Language Technologies Institute low resource NLP bootcamp 2020
joeddav/zero-shot-demo
intel-analytics/BigDL-2.x
BigDL: Distributed TensorFlow, Keras and PyTorch on Apache Spark/Flink & Ray
ageron/handson-ml2
A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2.
intel-analytics/ipex-llm
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, vLLM, GraphRAG, DeepSpeed, Axolotl, etc
intel/BigDL-Tutorials
Step-by-step Deep Leaning Tutorials on Apache Spark using BigDL
ajitrajasekharan/unsupervised_NER
Self-supervised NER prototype - updated version (69 entity types - 17 broad entity groups). Uses pretrained BERT models with no fine tuning. State-of-art performance on 3 biomedical datasets
JohnSnowLabs/nlu
1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
UKPLab/sentence-transformers
State-of-the-Art Text Embeddings
facebookresearch/faiss
A library for efficient similarity search and clustering of dense vectors.
JohnSnowLabs/spark-nlp-workshop
Public runnable examples of using John Snow Labs' NLP for Apache Spark.
Sakura-gh/ML-notes
notes about machine learning
JohnSnowLabs/spark-nlp
State of the Art Natural Language Processing
ontonotes/conll-formatted-ontonotes-5.0
This is a CoNLL formatted version of the OntoNotes 5.0 release.
stanfordnlp/CoreNLP
CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.
microsoft/presidio
Context aware, pluggable and customizable data protection and de-identification SDK for text and images
JohnSnowLabs/spark-nlp-models
Models and Pipelines for the Spark NLP library
aravindpai/How-to-build-own-text-summarizer-using-deep-learning
In this notebook, we will build an abstractive based text summarizer using deep learning from the scratch in python using keras
doccano/doccano
Open source annotation tool for machine learning practitioners.