Pinned Repositories
spark
Apache Spark - A unified analytics engine for large-scale data processing
_TFM
DB-GPT
Revolutionizing Database Interactions with Private LLM Technology
grpc
The C based gRPC (C++, Python, Ruby, Objective-C, PHP, C#)
grpc-go
The Go language implementation of gRPC. HTTP/2 based RPC
jina
Jina is the cloud-native neural search framework powered by state-of-the-art AI and deep learning
LightGBM
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.
metric-learning
mmlspark
Microsoft Machine Learning for Apache Spark
multimodal-learning
JoanFM's Repositories
JoanFM/DB-GPT
Revolutionizing Database Interactions with Private LLM Technology
JoanFM/langchain
⚡ Building applications with LLMs through composability ⚡
JoanFM/bm25s
Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy
JoanFM/candle
Minimalist ML framework for Rust
JoanFM/canopy
Retrieval Augmented Generation (RAG) framework and context engine powered by Pinecone
JoanFM/chroma
the AI-native open-source embedding database
JoanFM/CLIP_benchmark
CLIP-like model evaluation
JoanFM/ColBERT
ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)
JoanFM/docs
JoanFM/edenai-apis
Eden AI: simplify the use and deployment of AI technologies by providing a unique API that connects to the best possible AI engines
JoanFM/elasticsearch
Free and Open Source, Distributed, RESTful Search Engine
JoanFM/GCL
Generalised Contrastive Learning. This is a Repository for Google Shopping Dataset and Benchmarks followed by our novel fine-grained contrastive learning framework.
JoanFM/haystack
:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
JoanFM/haystack-core-integrations
Additional packages (components, document stores and the likes) to extend the capabilities of Haystack version 2.0 and onwards
JoanFM/haystack-integrations
🚀 A list of Haystack Integrations, maintained by the community or deepset.
JoanFM/lancedb
Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!
JoanFM/limbo
Limbo is a work-in-progress, in-process OLTP database management system, compatible with SQLite.
JoanFM/llama.cpp
LLM inference in C/C++
JoanFM/llama_index
LlamaIndex (formerly GPT Index) is a data framework for your LLM applications
JoanFM/milvus-model
The embedding/reranking model zoo help user to convert their unstructured data into embeedings
JoanFM/mteb
MTEB: Massive Text Embedding Benchmark
JoanFM/ollama
Get up and running with Llama 3, Mistral, Gemma, and other large language models.
JoanFM/pinecone-text
Pinecone text client library
JoanFM/ragstack-astradb
A reusable leave-behind for enterprise customers showing the differentiator of using the best Vector Store in the world: Astra DB
JoanFM/sentence-transformers
Multilingual Sentence & Image Embeddings with BERT
JoanFM/superduperdb
🔮 SuperDuperDB: Bring AI to your database: Integrate, train and manage any AI models and APIs directly with your database and your data.
JoanFM/tokenizers
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
JoanFM/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
JoanFM/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
JoanFM/weaviate
Weaviate is an open source vector database that stores both objects and vectors, allowing for combining vector search with structured filtering with the fault-tolerance and scalability of a cloud-native database, all accessible through GraphQL, REST, and various language clients.