numpy_k_mean

this is a k-mean realization (only in numoy) - multivariate and with arbitary k in python using numpy and linear algebra

T- number of itterations
plot is able up to 6 groups
emphesis on using vector computation
each empty group, initialized with the most far sample from the avrage of samples

to be done:

fix the double dealing with empty group
measuere times and compare
unerror weird data types, and nan handling
normalize weighs of samples
create and option of hypothesis
create a filtered input (remove outliers)
PCA- remove redudndant groups

first remark:

question in python (numpy)

what is more consuming?

reshaping Centers and Samples each time? each itteration, reshaping again, meaning recaculating the reshape

or reshaping Centers and Samples only once? approacing the 3d matrics via Cents[0,:,k] or Samples[guess == k,:,0] pro - reshaping only once con - neet to approche the matrics using ":" and i=0 exc..

'''

forye/numpy_k_mean

numpy_k_mean