Structure of the data
NurElHuda opened this issue · 0 comments
NurElHuda commented
Discussed in #39
Originally posted by NurElHuda January 9, 2022
@ZibanPirate @Fcmam5
I was wondering why the structure of the data is set as a tree of directories with info.json leaf instead of one main JSON file that contains all data.
- The overhead of traversing a JSON object compared to opening nested directories files to extract data is negligible.
- The name of the directories of the department does not make sense, the acronyms for the faculties are set by the universities and somewhat distinguishable, but once you dive into the departments you get lost.
- What is more important, is search. I can't think of an efficient way to search for, let's say, all the departments of geology amidst all the data.
- The moment the data is stored into one main JSON file will make automating, scraping, and extracting the data from the internet much faster.
So.. what do you guys say? I am happy to convert the available data into a JSON file, than continue with the rest.
Or is there other reasons for why the data is stored like this that I am missing?