Pinned Repositories
annoML
annoML - a open framework for annotating machine learning related visualization
antithesis_detection
Official implementation of "Using Pre-Trained Language Models in an End-to-End Pipeline for Antithesis Detection" accepted in LREC-2024
ir-rag-sigir24-persona-rag
pypadre
In this research project, we aim to create an environment to gather structured data about machine learning experiments in order to analyze data and algorithmich dependencies.
pypads
Building on the MLFlow toolset this project aims to extend the functionality for MLFlow, increase the automation and therefore reduce the workload for the user. The production of structured results is an additional goal of the extension.
pypads-onto
Extension for ontology integrations
pypads-padre
An extension of pypads tracking machine learning workflows (steps and concepts). Most of the concepts are derived from PaDRe.
robotstxt-study
A Longitudinal study of robots.txt files extracted from the Common Crawl web archive
simiir-2
SimIIR 2.0 extends the Python-based SimIIR framework for simulating interactive information retrieval (IIR).
tempweb24-content-control-study
Code for the paper 'A Longitudinal Study of Content Control Mechanisms' presented at the TempWeb workshop (WWW'24)
PaDaS-Lab's Repositories
padas-lab-de/ir-rag-sigir24-persona-rag
padas-lab-de/pypads
Building on the MLFlow toolset this project aims to extend the functionality for MLFlow, increase the automation and therefore reduce the workload for the user. The production of structured results is an additional goal of the extension.
padas-lab-de/simiir-2
SimIIR 2.0 extends the Python-based SimIIR framework for simulating interactive information retrieval (IIR).
padas-lab-de/pypadre
In this research project, we aim to create an environment to gather structured data about machine learning experiments in order to analyze data and algorithmich dependencies.
padas-lab-de/pypads-onto
Extension for ontology integrations
padas-lab-de/pypads-padre
An extension of pypads tracking machine learning workflows (steps and concepts). Most of the concepts are derived from PaDRe.
padas-lab-de/tempweb24-content-control-study
Code for the paper 'A Longitudinal Study of Content Control Mechanisms' presented at the TempWeb workshop (WWW'24)
padas-lab-de/antithesis_detection
Official implementation of "Using Pre-Trained Language Models in an End-to-End Pipeline for Antithesis Detection" accepted in LREC-2024
padas-lab-de/instruct-to-sparql
Instruct-to-SPARQL is a dataset that consists of pairs of Natural language instructions and SPARQL queries. The dataset is created by crawling Wikipedia pages and tutorials for real examples of WikiData SPARQL queries.
padas-lab-de/intra-class-similarity-guided-distillation
Official implementation of "Intra-Class Similarity-Guided Feature Distillation" accepted in NeurIPS-ENLSP 2023
padas-lab-de/krony-PT
Compressing GPT2 using Kronecker products.
padas-lab-de/learn-from-one-specialized-sub-teacher
Official implementation of "Learn From One Specialized Sub-Teacher: One-to-One Mapping for Feature-Based Knowledge Distillation" accepted in EMNLP-Findings 2023
padas-lab-de/memBERT
Source code for the MemBERT paper
padas-lab-de/multi-language-dataset-creator
padas-lab-de/robotstxt-study
A Longitudinal study of robots.txt files extracted from the Common Crawl web archive
padas-lab-de/url-classification
Classify webpages using their URLs
padas-lab-de/url-dataset-crawling
Code to crawl content of a dataset of URLs
padas-lab-de/brwsr
Lightweight Linked Data Browser
padas-lab-de/Can-GPT-4-Replace-Human-Examiners-A-Competition-on-Checking-Open-Text-Answers
Accompanying code for the paper "Can GPT-4 Replace Human Examiners? A Competition on Checking Open-Text Answers"
padas-lab-de/icadl24-agent4dl
Agent4DL is a Python-based simulation framework designed to model user search behavior using agentic large language models (LLMs).
padas-lab-de/mlflow-docker
Production ready docker-compose configuration for ML Flow with Mysql and Minio S3
padas-lab-de/padre-lab-eu.github.io
Web Page for the PAssau Data science REsearch Lab
padas-lab-de/Performance-analysis-of-large-language-models-in-the-domain-of-legal-argument-mining
Accompanying code for the paper "Performance analysis of large language models in the domain of legal argument mining"
padas-lab-de/pypads-examples
Python file examples for PyPads
padas-lab-de/pypads-notebooks
padas-lab-de/pypads_examples
This repository includes a set of examples how to use pypads
padas-lab-de/pypads_jupyter_examples
This repository shows examples how PyPads can be used with a jupyter notebook.
padas-lab-de/tpdl24-comparative-analysis-user-interactions-in-digital-libraries
This study compares EconBiz (private) and SUSS (public) datasets, revealing EconBiz's more detailed user interactions. It highlights the scarcity of public datasets with rich user data in digital libraries and explores using LLMs to simulate detailed interactions while preserving anonymity, aiming to enhance public datasets for improved research.
padas-lab-de/wikibase-docker
🐳 Docker images and example compose file for Wikibase and surrounding services
padas-lab-de/zero-to-jupyterhub-k8s
Helm Chart & Documentation for deploying JupyterHub on Kubernetes