cplouffesensibill's Stars
eugeneyan/applied-ml
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
PrefectHQ/prefect
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
airbytehq/airbyte
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
UKPLab/sentence-transformers
State-of-the-Art Text Embeddings
AI4Finance-Foundation/FinGPT
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
scala/scala
Scala 2 compiler and standard library. Scala 2 bugs at https://github.com/scala/bug; Scala 3 at https://github.com/scala/scala3
aws/amazon-sagemaker-examples
Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.
dbt-labs/dbt-core
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
python-attrs/attrs
Python Classes Without Boilerplate
bats-core/bats-core
Bash Automated Testing System
subframe7536/maple-font
[Try V7!] Maple Mono: Open source monospace font with round corner, ligatures and Nerd-Font for IDE and command line. 带连字和控制台图标的圆角等宽字体,中英文宽度完美2:1
princeton-nlp/tree-of-thought-llm
[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models
scalaz/scalaz
Principled Functional Programming in Scala
amundsen-io/amundsen
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
awslabs/deequ
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
aws/deep-learning-containers
AWS Deep Learning Containers are pre-built Docker images that make it easier to run popular deep learning frameworks and tools on AWS.
dgrtwo/fuzzyjoin
Join tables together on inexact matching
ran-isenberg/aws-lambda-handler-cookbook
This repository provides a working, deployable, open source-based, serverless service blueprint with an AWS Lambda function and AWS CDK Python code with all the best practices and a complete CI/CD pipeline.
tensorflow/lattice
Lattice methods in TensorFlow
gjbae1212/gossm
💻Interactive CLI tool that you can connect to ec2 using commands same as start-session, ssh in AWS SSM Session Manager
gocardless/airflow-dbt
Apache Airflow integration for dbt
awslabs/kinesis-kafka-connector
kinesis-kafka-connector is connector based on Kafka Connect to publish messages to Amazon Kinesis streams or Amazon Kinesis Firehose.
voila-dashboards/voila-vuetify
Dashboard template for Voilà based on VuetifyJS
aws-samples/amazon-sagemaker-ground-truth-task-uis
Example task UIs for Amazon SageMaker Ground Truth
transferwise/pipelinewise-target-snowflake
Singer.io Target for Snowflake - PipelineWise compatible
nicor88/dbt-serverless
Run dbt serverless in the Cloud (AWS)
DyfanJones/RAthena
Connect R to Athena using Boto3 SDK (DBI Interface)
nathangiusti/PySense
Python SDK for Sisense
carter-kilgour/delta-quality-testing
Lighting Talk Data & AI Summit Europe 2020 - Data Quality Testing in the Medallion Architecture