Issues
- 0
error with data-formatter
#79 opened by tiblop - 3
use LabelEncoder on y values
#74 opened by ClimbsRocks - 2
- 12
error on data-formatter, to lower function
#77 opened by MelvinDunn - 0
long-term nlp column cleanup
#76 opened by ClimbsRocks - 0
update readme for NLP
#75 opened by ClimbsRocks - 0
delete temporary files
#73 opened by ClimbsRocks - 0
- 0
- 0
add nlp to featureEngineering
#70 opened by ClimbsRocks - 0
update validation.py to allow NLP
#69 opened by ClimbsRocks - 0
update API for validationSplit column
#68 opened by ClimbsRocks - 0
more obvious input validation logs
#67 opened by ClimbsRocks - 0
- 1
code clean up
#65 opened by ClimbsRocks - 1
- 0
rename kagglePredict to just predict
#63 opened by ClimbsRocks - 0
- 0
- 0
make a module in featureSelecting available widely for easy feature pruning
#60 opened by ClimbsRocks - 0
FUTURE: add in a subset of new features, prune out the non-useful ones, repeat
#59 opened by ClimbsRocks - 0
add in option to keep all features
#58 opened by ClimbsRocks - 1
- 1
support groupBy for joined data
#55 opened by ClimbsRocks - 1
- 1
update validation.py to handle "date" as an acceptable dataDescription type
#50 opened by ClimbsRocks - 1
bug- joined values
#54 opened by ClimbsRocks - 0
subtract all dates from each other
#53 opened by ClimbsRocks - 0
give the user the ability to run a script doing their own custom data formatting
#52 opened by ClimbsRocks - 0
- 1
handle the validation split in df
#49 opened by ClimbsRocks - 1
write all files as sparse matrices
#45 opened by ClimbsRocks - 0
rossman appears to blow up the fileNames test, even though it works on a very large dataset when run outside the test suite
#48 opened by ClimbsRocks - 0
adds fileNames test to all our test datasets
#47 opened by ClimbsRocks - 0
- 0
offer a don't prune option
#39 opened by ClimbsRocks - 0
- 1
- 1
add tests for joining datasets
#41 opened by ClimbsRocks - 1
- 0
remove first row
#43 opened by ClimbsRocks - 1
handle having no ID for input data
#35 opened by ClimbsRocks - 0
switch back to using rfecv
#38 opened by ClimbsRocks - 0
more feature engineering
#37 opened by ClimbsRocks - 0
look into taking the log of the output column
#36 opened by ClimbsRocks - 0
handle dates
#34 opened by ClimbsRocks - 1
- 1
reformat for messageParent
#31 opened by ClimbsRocks - 0
vectorize output data when relevant
#33 opened by ClimbsRocks - 0
get brain.js formatting back up and running
#32 opened by ClimbsRocks