- LIGO-Feature-Stat is an application to measure the importance in classifying of features from LIGO detecters.
- The idea to measure the importance of features here is as follows. For a given testing point, we find the nearest point to it on the separating surface. Then we compute the margin on each feature between these two points and let the feature with biggest margin be the most important classifying feature for this individual testing point. Then for a set of many testing points, we could measure the importance of a feature by the times of that be the most important classifying feature for a individual testing point. Then we could get a histogram for all the features and find something.
- Get S6_100ms_set_0_train.dat.scale as testing dataset.
- Get S6_100ms_full_train.dat.model to find the points on the separating suiface.
- Compile: $ g++ LFS.c svm.cpp -o LFS
- Run: $ ./LFS S6_100ms_full_train.dat.model S6_100ms_set_0_train.dat.scale
We could present some of the most important features here(the number of tesing set here is 5000):
index
Feature_Count
824
414
724
405
559
394
709
346
644
222
774
196
642
171
584
157
579
138
934
124
Obviously, The most important features are from the 4th feature of detecters.
For most features, their count is zero. From the histogram, We could see a few pulses on it.