Research Journal of Information Technology1815-74322151-7959Academic Journals Inc.10.3923/rjit.2015.112.120SemanAliSapawiAzizian MohdSallehMohd Zaki2201572The effectiveness of the performance of κ-Approximate Modal Haplotype (κ-AMH)-type algorithms for clustering Y-short tandem repeats (Y-STR) of categorical data has been demonstrated previously. However, newly introduced κ-AMH-type algorithms, including the new κ-AMH I (Nκ-AMH 1), the new κ-AMH II (Nκ-AMH II) and the new κ-AMH III (Nκ-AMH III), are derived from the same κ-AMH optimization and fuzzy procedures but with the inclusion of two new methods, namely, new initial center selection and new dominant weighting methods. This study evaluates and presents the performance of κ-AMH-type algorithms for clustering five categorical data sets-namely, soybean, zoo, hepatitis, voting and breast. The performance criteria include accuracy, precision and recall analyses. Overall, κ-AMH-type algorithms perform well when clustering all of the categorical data sets mentioned above. Specifically, the N κ-AMH I algorithm exhibits the best performance when clustering the five categorical data sets; this algorithm obtained the highest combined mean accuracy score (at 0.9130), compared to those of κ-AMH (0.8971), N κ-AMH II (0.8885) and N κ-AMH III (0.9011). This high score is associated with the newly introduced initial center selection, combined with the original dominant weighting method. These results present a new and significant benchmark, indicating that κ-AMH-type algorithms can be generalized for any categorical data.]]>He, Z., X. Xu and S. Deng,2007Huang, Z.,1998Huang, Z. and M.K. Ng,1999Kim, D.W., K.Y. Lee, D. Lee and K.H. Lee,2005Li, M.J., M.K. Ng, Y.M. Cheung and J.Z. Huang,2008Lichman, M.,2013Ng, M.K. and L. Jing,2009Ng, M.K., M.J. Li, J.Z. Huang and Z. He,2007Seman, A., Z.A. Bakar and A.M. Sapawi,2010Seman, A., Z.A. Bakar and A.M. Sapawi,2010Seman, A., Z.A. Bakar and A.M. Sapawi,2010Seman, A., Z.A. Bakar and A. M. Sapawi,2010Seman, A., Z.A. Bakar and A.M. Sapawi,2010Seman, A., Z.A. Bakar and A.M. Sapawi,2010Seman, A., Z.A. Bakar and M.N. Isa,2012Seman, A., Z.A. Bakar and M.N. Isa,2012k-Modes-Type Algorithms for Clustering Y-Short Tandem Repeats Data.]]>Seman, A., Z.A. Bakar and M.N. Isa,2013Seman, A., Z.A. Bakar, A.M. Sapawi and I.R. Othman,2013Seman, A., A.M. Sapawi and M.Z. Salleh,2015