semi-structured-data
There are 26 repositories under semi-structured-data topic.
snap-stanford/stark
STaRK: Benchmarking LLM Retrieval on Textual and Relational Knowledge Bases (NeurIPS D&B 2024)
VorTECHsa/refinery
Refinery is a tool to extract and transform semi-structured data from Excel spreadsheets of different layouts in a declarative way.
BartJongejan/Bracmat
Programming language for symbolic computation with unusual combination of pattern matching features: Tree patterns, associative patterns and expressions embedded in patterns.
utahnlp/infotabs-code
Implementation of the semi-structured inference model in our ACL 2020 paper, INFOTABS: Inference on Tables as Semi-structured Data.
utahnlp/knowledge_infotabs
Repository containing code for the NAACL 2021 paper (Incorporating External Knowledge to Enhance Tabular Reasoning)
rub-ksv/MyFixit-Dataset
A dataset for extracting information from repair manuals
ropensci/EndoMineR
Endoscopic and Pathological data extraction for various endo-pathological data extraction
mansakondo/activemodel-embedding
An ActiveModel extension to model your semi-structured data using embedded associations
cyk1337/UrbanDict
Urban Dict spelling variant dataset. Source code of How to Evaluate Word Representations of Informal Domain?
Dibyakanti/AutoTNLI-code
This repository contains the official code for the paper : Realistic Data Augmentation Framework for Enhancing Tabular Reasoning (Findings-EMNLP, 2022).
lucaliechti/FCAInference
Schema inference for semistructured data using Formal Concept Analysis
taehyounpark/queryosity
Coherent data analysis library
rub-ksv/MyFixit-Annotator
A semi-automatic web-based annotation tool for MyFixit dataset :
Info-Sync/InfoSync
Implementation of the semi-structured inference model in our ACL 2023 paper: INFOSYNC: Information Synchronization across Multilingual Semi-structured Tables.
kuhumcst/texton-Java
Web-based workflow management system that computes candidate tool workflows given input file(s) and the user's requirements regarding the output. Afterwards, runs a workflow selected by the user from the list of candidates. Implemented in Bracmat (~75%) and Java (~25%).
patrikken/PrefTwig2Stack
Java Standalone application for querying XML documents with requests with preferences (GTPs requests with preferences)
sebastiz/EndoMineR
Endoscopic and Pathological data extraction for various endo-pathological data extraction
ngmy/eloquent-serialized-lob
Eloquent Serialized LOB is a trait for Laravel Eloquent models that allows Serialized LOB pattern
RomualdRousseau/Archery
Framework to manipulate semi structured documents and extract data from them
ecemuzun/Database-Manipulation-and-RShiny
Report of a project concerning database construction, management and manipulation that uses various .xml and .csv files from open sources with semi-structured and unstructured data. The analysis is visualised by RShiny dashboard.
meaghancoconnor/prerequiste_checks
A python program which parses student transcript data to determine eligibility
dimitrisrod/masterThesis
MPS Search Engine
MaimoonaKhilji/Hive-Queries
Hive queries
ricardoariasalazar/Tweets-Parsing
Extract data from semi-structured text files and transform the data into XML format.
RomualdRousseau/PyAny2Json
Python binding of Any2Json