/chategw

Primary LanguageJupyter Notebook

Data preparation

Dotnet data preparation

$ cd src/ChatEgw.UI.Indexer
export EGW_SEARCH_DSN="Server=localhost;Database=search;Port=15432;Username=postgres;Password=password"

Create database

$ dotnet run -- migrate

Import base data

$ dotnet run -- import egw -f "Host=localhost;Username=user;Password=password;Database=database"

Export data to file for python postprocessing

$ dotnet run -- export tsv paragraphs-raw.tsv

Extract tagging from raw data

$ cd cuda-backend