Short note on Data discretization.

- December 27, 2021

Data discretization transforms numeric data by mapping values to interval or concept labels. Such methods can be used to automatically generate concept hierarchies for the data, which allows for mining at multiple levels of granularity. Discretization techniques include binning, histogram analysis, cluster analysis, decision tree analysis, and correlation analysis. For nominal data, concept hierarchies may be generated based on schema definitions as well as the number of distinct values per attribute.

Although numerous methods of data preprocessing have been developed, data pre-processing remains an active area of research, due to the huge amount of inconsistent or dirty data and the complexity of the problem.

Search This Blog

Notes for BSc CSIT

Short note on Data discretization.

Comments

Post a Comment

Popular posts from this blog

Discuss classification or taxonomy of virtualization at different levels.