Title: SEPARATION IN DATA MINING BASED ON FRACTAL NATURE OF DATA
Issue Number: | Vol. 3, No. 1 |
Year of Publication: | 2013 |
Page Numbers: | 50-66 |
Authors: | Marcel Jirina, Marcel Jirina Jr. |
Journal Name: | International Journal of Digital Information and Wireless Communications (IJDIWC) - Hong Kong |
Abstract:
The separation of the searched data from the rest is an important task in data mining. Three separation/classification methods are presented. We use a singularity exponent in classifiers that are based on distances of patterns to a given (classified) pattern. The approximation of so called probability distribution mapping function of the distribution of points from the viewpoint of distances from a given point in the form of a scaling exponent power of a distance is presented together with a way how to state it. Considering data as points in a metric space, three methods are based on transformed distances of neighbors of a given point in a multidimensional space via functions that use different estimates of scaling exponent. Classifiers – data separators utilizing knowledge about explored data distribution in a space and suggested expressions of the scaling exponent are presented. Experimental results on both synthetic and real-life data show interesting behavior (classification accuracy) of classifiers in comparison with other well-known approaches.