Pinned Repositories
aircraft_classifier
This repo is the source of the web app built on the cloud platoform Heroku at:
algorithms_data_structures
Algorithms & Data Structures in Python
asymmetric_loss
Implement a custom asymmetric loss to train and drive a regression model towards underestimation or overestimation
interpretable_machine_learning
Getting explanations for predictions made by black box models.
kafka_stream_air_quality
Retrieving the air quality index of various cities through an HTTP API and generating a Kafka stream.
llm_app
My own LLM server.
nlp_hugging_face
Text classification with the transformers library from Hugging Face, by fine-tuning DistilBERT or using summarization + Zero-Shot classification.
stocks_backend
FastAPI backend to fetch stocks data
unlimited-translation_docker_swarm
A translation service (German to English) with no character limit.
weighted_random_forest_spark_3
Experiment with sample weights support added to random forest and a few other classifiers in Spark 3
datatrigger's Repositories
datatrigger/asymmetric_loss
Implement a custom asymmetric loss to train and drive a regression model towards underestimation or overestimation
datatrigger/interpretable_machine_learning
Getting explanations for predictions made by black box models.
datatrigger/nlp_hugging_face
Text classification with the transformers library from Hugging Face, by fine-tuning DistilBERT or using summarization + Zero-Shot classification.
datatrigger/aircraft_classifier
This repo is the source of the web app built on the cloud platoform Heroku at:
datatrigger/algorithms_data_structures
Algorithms & Data Structures in Python
datatrigger/kafka_stream_air_quality
Retrieving the air quality index of various cities through an HTTP API and generating a Kafka stream.
datatrigger/llm_app
My own LLM server.
datatrigger/shiny_apps
Run the CLT Shiny app : https://datatrigger.shinyapps.io/CLT_Visualization/
datatrigger/datatrigger
A blog about software engineering.
datatrigger/unlimited-translation_kubernetes
Deploying our multi-container app unlimited-translation to a GKE cluster
datatrigger/stocks_backend
FastAPI backend to fetch stocks data
datatrigger/stocks_frontend
Frontend of stocks single-page application stocks.vlgdata.io
datatrigger/unlimited-translation_docker_swarm
A translation service (German to English) with no character limit.
datatrigger/unlimited_translation-backend
Source of the Docker image for the API backend of the unlimited-translation app
datatrigger/unlimited_translation-frontend-k8s
Source code of a translation app's Flask frontend microservice
datatrigger/weighted_random_forest_spark_3
Experiment with sample weights support added to random forest and a few other classifiers in Spark 3
datatrigger/anomaly_detection_lof
Anomaly detection using Local Outlier Factor / Isolation Forest with Python / R
datatrigger/beautifulhugo-hydronium
Theme for the Hugo static website generator
datatrigger/computer_vision
Computer vision projects with PyTorch and fast.ai.
datatrigger/light_gradient_boosted_machine
Experiment with with Microsoft's LightGBM to predict the critical temperature of superconductors.
datatrigger/scaling
How to properly split and scale a dataset using Python, Spark & R modules.
datatrigger/school_projects
Here are some projects I did during my master of science in statistics and data science at ISUP, Paris.
datatrigger/snippets
Snippets of code frequently used
datatrigger/subtotals
Adding totals and subtotals rows with pandas / the tidyverse
datatrigger/sum_random_variables
Contour plots to answer this question: if the sum of two random variables is large, are they likely to be both large ?
datatrigger/unlimited_translation-frontend-swarm
Source code of the Docker image for the unlimited-translation app frontend
datatrigger/wiremock-spring-boot
WireMock Spring Boot drastically simplifies testing HTTP clients in Spring Boot & Junit 5 based integration tests.