/WSM2020_proj

Code for WSM 2020 Project

Primary LanguageCSSMIT LicenseMIT

WSM2020_proj

Code for WSM 2020 Project: Chinese WestLaw system

demo

Requirements

  • Python3
  • numpy
  • Django 3.0.6 (For web interface)
  • pkuseg
  • nltk
  • fuzzywuzzy

Data Preparation

Download the original data from the course website.

Data1 & Data2

  1. Put the scripts utils/index_for_data1.py and utils/index_for_data2.py into the data folder, on the same position with zxgk/ for data1, info/ for data2.

  2. Remove the duplicated files, rename the .json files with successive integer and build inverted index:

     python index_for_data1.py
     python index_for_data2.py
    

Instrument

  1. Build inverted index and tf-idf dictionary.

  2. Put the generated indexes and original data into folder /data1/ and /data2/.

We also provide the processed data (extraction code: izcl).

Usage

  1. Change to the root of the project, run python manage.py runserver [port] to start the server.

  2. Search legal case records with boolean search, we also provide fuzzy search for wrongly inputs. boolean

  3. Query instruments with a sentence. query sentence

  4. Sort the results according to your needs. sort

Contributor