O uso de algoritmos genéticos para a solução de problemas de agrupamento automático

Lima, Suzane Pereira

Please use this identifier to cite or link to this item: https://rima.ufrrj.br/jspui/handle/20.500.14407/14340

Full metadata record

DC Field	Value	Language
dc.contributor.author	Lima, Suzane Pereira
dc.date.accessioned	2023-12-22T03:00:02Z	-
dc.date.available	2023-12-22T03:00:02Z	-
dc.date.issued	2019-07-30
dc.identifier.citation	LIMA, Suzane Pereira. O uso de algoritmos genéticos para a solução de problemas de agrupamento automático. 2019. 92 f. Dissertação (Mestrado em Modelagem Matemática e Computacional) - Instituto de Ciências Exatas, Universidade Federal Rural do Rio de Janeiro, Seropédica, 2019.	por
dc.identifier.uri	https://rima.ufrrj.br/jspui/handle/20.500.14407/14340	-
dc.description.abstract	Técnicas de agrupamento de dados consistem na organização de um conjunto de informações em grupos de acordo com as similaridades presentes em seus registros, assim propriedades comuns entre o conjunto de dados conseguem ser identificadas facilitando a sua compreensão. Nem sempre o número de grupos é um dado disponível a priori para a resolução desse processo. Quando esta informação é desconhecida tem-se o denominado Problema de Agrupamento Automático. Neste trabalho são apresentadas estratégias para a resolução deste problema, tendo como base a meta-heurística Algoritmo Genético. Sabendo que a qualidade de um agrupamento pode ser influenciada pelo modo como são gerados os grupos iniciais e pela escolha da função de avaliação das soluções, diferentes procedimentos são propostos a partir de um método já existente com o objetivo de gerar resoluções de melhor qualidade. Experimentos foram aplicados às propostas para diversos conjuntos de dados. Os resultados obtidos são comparados entre si e com outros trabalhos da literatura.	por
dc.description.sponsorship	CAPES - Coordenação de Aperfeiçoamento de Pessoal de Nível Superior	por
dc.format	application/pdf	*
dc.language	por	por
dc.publisher	Universidade Federal Rural do Rio de Janeiro	por
dc.rights	Acesso Aberto	por
dc.subject	Problema de Agrupamento Automático	por
dc.subject	Heurísticas	por
dc.subject	Otimização	por
dc.subject	Automatic Clustering Problem	eng
dc.subject	Heuristics	eng
dc.subject	Optimization	eng
dc.title	O uso de algoritmos genéticos para a solução de problemas de agrupamento automático	por
dc.type	Dissertação	por
dc.description.abstractOther	The Clustering consists to organize a set of information into groups according to the similarities present in their records, so that common properties between the data can be identified for easy understanding. Sometimes the number of groups is a unavailable data for the resolution of this process. When this information is unknown a priori we have the so-called Automatic Clustering Problem. In this work are presented algorithms to solve this problem, based on the meta-heuristic Genetic Algorithm. The quality of a Clustering can be affected by the way the initial groups are generated and by the choice of the cluster validity index, different methods are proposed based on an already existing methodology with the purpose of generating better quality resolutions. Experiments were applied to proposed approaches for several data sets. The obtained results are compared with each other and with other works of the literature.	eng
dc.contributor.advisor1	Cruz, Marcelo Dib
dc.contributor.advisor1ID	CPF: 016.628.007-03	por
dc.contributor.referee1	Vianna, Gizelle Kupac
dc.contributor.referee2	Tassinari, Wagner de Souza
dc.creator.ID	CPF: 146.054.587-78	por
dc.creator.Lattes	http://lattes.cnpq.br/7971242219042855	por
dc.publisher.country	Brasil	por
dc.publisher.department	Instituto de Ciências Exatas	por
dc.publisher.initials	UFRRJ	por
dc.publisher.program	Programa de Pós-Graduação em Modelagem Matemática e Computacional	por
dc.relation.references	[1] Clustering basic benchmark. Disponível em: http://cs.joensuu.fi/sipu/datasets/; Acessado em 04 de Abril de 2018. [2] Machine learning repository. Disponível em: http://archive.ics.uci.edu/ml/index.php; Acessado em 13 de Julho de 2017. [3] ANARI, B., TORKESTANI, J. A., E RAHMANI, A. Automatic data clustering using continuous action-set learning automata and its application in segmentation of images. Applied Soft Computing 51 (2017), 253–265. [4] ARENALES, M., MORABITO, R., ARMENTANO, V., E YANASSE, H. Pesquisa operacional: Para cursos de engenharia. Elsevier Brasil, 2015. [5] BECKER, J. L. Estatística básica: transformando dados em informação. Bookman Editora, 2015. [6] BERKHIN, P. A survey of clustering data mining techniques. In Grouping multidimensional data. Springer, 2006, pp. 25–71. [7] BRITO, J. D. M., E MONTENEGRO, F. M. T. Aplicações de Técnicas de Pesquisa Operacional em Problemas de Agrupamento do IBGE. População, espaço e sustentabilidade: contribuições para o desenvolvimento do Brasil, 01 2015, pp. 15–33. [8] CALI´NSKI, T., E HARABASZ, J. A dendrite method for cluster analysis. Communications in Statistics-theory and Methods 3, 1 (1974), 1–27. [9] CHARU, C. A., E CHANDAN, K. R. Data clustering: algorithms and applications. Chapman and Hall/CRC Boca Raton, 2013. [10] CHOU, C.-H., SU, M.-C., E LAI, E. A new cluster validity measure and its application to image compression. Pattern Analysis and Applications 7, 2 (2004), 205–220. [11] CRUZ, M. D. O problema de clusterização automática. PhD thesis, COPPE/UFRJ, Rio de Janeiro, 2010. [12] DAS, S., ABRAHAM, A., E KONAR, A. Automatic clustering using an improved differential evolution algorithm. IEEE Transactions on systems, man, and cybernetics-Part A: Systems and Humans 38, 1 (2008), 218–237. [13] DAS, S., CHOWDHURY, A., E ABRAHAM, A. A bacterial evolutionary algorithm for automatic data clustering. In 2009 IEEE Congress on Evolutionary Computation (2009), IEEE, pp. 2403–2410. [14] DAVIES, D. L., E BOULDIN, D. W. A cluster separation measure. IEEE transactions on pattern analysis and machine intelligence, 2 (1979), 224–227. [15] DOVAL, D., MANCORIDIS, S., E MITCHELL, B. S. Automatic clustering of software systems using a genetic algorithm. In STEP’99. Proceedings Ninth International Workshop Software Technology and Engineering Practice (1999), IEEE, pp. 73–81. [16] DUNN, J. C. A fuzzy relative of the isodata process and its use in detecting compact wellseparated clusters. Journal of Cybernetics (1973). [17] FERRARI, D. G., E SILVA, L. N. D. C. Introdução a mineração de dados. Editora Saraiva, 2017. [18] FISHER, R. A. The use of multiple measurements in taxonomic problems. Annals of eugenics 7, 2 (1936), 179–188. [19] GAN, G., MA, C., E WU, J. Data clustering: theory, algorithms, and applications, vol. 20. Siam, 2007. [20] GOLDSCHMIDT, R. R. Uma Introdução à Inteligência Computacional: fundamentos, ferramentas e aplicações. IST-Rio, 2010. [21] GOLDSCHMIDT, R. R. Tópicos Especiais em Inteligência Computacional. IST-Rio, 2011. [22] HANSEN, P., E JAUMARD, B. Cluster analysis and mathematical programming. Mathematical programming 79, 1-3 (1997), 191–215. [23] HARSH, A., E BALL, J. E. Automatic k-expectation maximization (a k-em) algorithm for data mining applications. Journal of Computations & Modelling 6, 3 (2016), 43–85. [24] HE, H., E TAN, Y. A two-stage genetic algorithm for automatic clustering. Neurocomputing 81 (2012), 49–59. [25] HILLIER, F. S., E LIEBERMAN, G. J. Introdução à pesquisa operacional. McGraw Hill Brasil, 2013. [26] JAIN, A. K., DUBES, R. C., E OTHERS. Algorithms for clustering data, vol. 6. Prentice hall Englewood Cliffs, 1988. [27] JAIN, A. K., MURTY, M. N., E FLYNN, P. J. Data clustering: a review. ACM computing surveys (CSUR) 31, 3 (1999), 264–323. [28] JOSÉ-GARCÍA, A., E GÓMEZ-FLORES, W. Automatic clustering using nature-inspired metaheuristics: A survey. Applied Soft Computing 41 (2016), 192–213. [29] KETTANI, O., RAMDANI, F., E TADILI, B. Ak-means: an automatic clustering algorithm based on k-means. Journal of Advanced Computer Science & Technology 4, 2 (2015), 231. [30] KUDOVA, P. Clustering genetic algorithm. In 18th International Workshop on Database and Expert Systems Applications (DEXA 2007) (2007), IEEE, pp. 138–142. [31] KUNDU, D., SURESH, K., GHOSH, S., DAS, S., ABRAHAM, A., E BADR, Y. Automatic clustering using a synergy of genetic algorithm and multi-objective differential evolution. In International Conference on Hybrid Artificial Intelligence Systems (2009), Springer, pp. 177–186. [32] KUO, R., HUANG, Y., LIN, C.-C., WU, Y.-H., E ZULVIA, F. E. Automatic kernel clustering with bee colony optimization algorithm. Information Sciences 283 (2014), 107–122. [33] LINDEN, R. Algoritmos genéticos (2a ediçao). Brasport, 2008. [34] LINDEN, R. Técnicas de agrupamento. Revista de Sistemas de Informação da FSMA 4 (2009), 18–36. [35] LIU, Y., WU, X., E SHEN, Y. Automatic clustering using genetic algorithms. Applied mathematics and computation 218, 4 (2011), 1267–1279. [36] LOESCH, C., E HEIN, N. Pesquisa operacional. Editora Saraiva, 2009. [37] LOESCH, C., E HOELTGEBAUM, M. Métodos estatísticos multivariados. Editora Saraiva, 2017. [38] MARONNA, R., E JACOVKIS, P. M. Multivariate clustering procedures with variable metrics. Biometrics (1974), 499–505. [39] MARTINEZ, J. M., E SANTOS, S. A. Métodos computacionais de otimização. In XX Colóquio Brasileiro de Matemática, IMPA (1995). [40] MAULIK, U., E BANDYOPADHYAY, S. Genetic algorithm-based clustering technique. Pattern recognition 33, 9 (2000), 1455–1465. [41] MAULIK, U., E BANDYOPADHYAY, S. Performance evaluation of some clustering algorithms and validity indices. IEEE Transactions on pattern analysis and machine intelligence 24, 12 (2002), 1650–1654. [42] MISHRA, S., SAHA, S., E MONDAL, S. A multiobjective optimization based entity matching technique for bibliographic databases. Expert Systems with Applications 65 (2016), 100–115. [43] MITCHELL, M. An introduction to genetic algorithms. MIT press, 1998. [44] NANDA, S. J., E PANDA, G. A survey on nature inspired metaheuristic algorithms for partitional clustering. Swarm and Evolutionary computation 16 (2014), 1–18. [45] OCHI, L. S., DIAS, C. R., E SOARES, S. S. F. Clusterização em mineração de dados. Instituto de Computação-Universidade Federal Fluminense-Niterói (2004), 26. [46] PACHECO, T. M., BRUGIOLO, L., STRÖELE, V., E SÃ, S. Metaheurística inspirada no comportamento das formigas aplicada ao problema de agrupamento. In XIII Congresso Brasileiro de Inteligência Computacional (2017). [47] PAL, N. R., E BEZDEK, J. C. On cluster validity for the fuzzy c-means model. IEEE Transactions on Fuzzy systems 3, 3 (1995), 370–379. [48] QUEIROGA, E. V., SUBRAMANIAN, A., E CABRAL, L. D. A. F. Abordagem baseada em continuous grasp para clusterização de dados. In XLVIII SBPO (2016). [49] RAPOSO, C., ANTUNES, C. H., E BARRETO, J. P. Automatic clustering using a genetic algorithm with new solution encoding and operators. In International Conference on Computational Science and Its Applications (2014), Springer, pp. 92–103. [50] REZENDE, S. O., MARCACINI, R. M., E MOURA, M. F. O uso da mineração de textos para extração e organização não supervisionada de conhecimento. Embrapa Informática Agropecuária-Artigo em periódico indexado (ALICE) (2011). [51] RODRIGUES, W. C., FADEL, A. C., SEMAAN, G. S., E BRITO, J. A. D. M. Um novo método baseado em grade e densidade com tratamento de ruídos para a identificação do número ideal de grupos. In XLVIII SBPO (2016). [52] ROUSSEEUW, P. J. Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. Journal of computational and applied mathematics 20 (1987), 53–65. [53] RUSPINI, E. H. Numerical methods for fuzzy clustering. Information Sciences 2, 3 (1970), 319–350. [54] SCHWAAB, M. Análise de Dados Experimentais: I. Fundamentos de Estatística e Estimação de Parâmetros. Editora E-papers, 2007. [55] SEMAAN, G. S., CRUZ, M. D., BRITO, G. D. M., E OCHI, L. S. Proposta de um métodode classificação baseado em densidade para a determinação do número ideal de grupos em problemas de clusterização. Journal of the Brazilian Computational Intelligence Society 10, 4 (2012), 242–262. [56] SEMAAN, G. S., FADEL, A. C., DE MOURA BRITO, J. A., E OCHI, L. S. Uma heurística baseada em densidade para o problema de agrupamento automático. Blucher Marine Engineering Proceedings 2, 1 (2016), 100–112. [57] THEODORIDIS, S., E KOUTROUMBAS, K. Pattern recognition. Second Edition. Elsevier, 2003. [58] TSENG, L. Y., E YANG, S. B. A genetic approach to the automatic clustering problem. Pattern recognition 34, 2 (2001), 415–424. [59] VEENMAN, C. J., REINDERS, M. J. T., E BACKER, E. A maximum variance cluster algorithm. IEEE Transactions on pattern analysis and machine intelligence 24, 9 (2002), 1273–1280. [60] WANG, X., QIU, W., E ZAMAR, R. H. Clues: A non-parametric clustering method based on local shrinking. Computational Statistics & Data Analysis 52, 1 (2007), 286–298.	por
dc.subject.cnpq	Ciência da Computação	por
dc.subject.cnpq	Matemática	por
dc.thumbnail.url	https://tede.ufrrj.br/retrieve/67578/2019%20-%20Suzane%20Pereira%20Lima.pdf.jpg	*
dc.originais.uri	https://tede.ufrrj.br/jspui/handle/jspui/5245
dc.originais.provenance	Submitted by Jorge Silva (jorgelmsilva@ufrrj.br) on 2021-11-20T22:24:54Z No. of bitstreams: 1 2019 - Suzane Pereira Lima.pdf: 2995803 bytes, checksum: aeb78744da9f2f8ed596bb13477fde62 (MD5)	eng
dc.originais.provenance	Made available in DSpace on 2021-11-20T22:24:57Z (GMT). No. of bitstreams: 1 2019 - Suzane Pereira Lima.pdf: 2995803 bytes, checksum: aeb78744da9f2f8ed596bb13477fde62 (MD5) Previous issue date: 2019-07-30	eng
Appears in Collections:	Mestrado em Modelagem Matemática e Computacional

Se for cadastrado no RIMA, poderá receber informações por email.
Se ainda não tem uma conta, cadastre-se aqui!

Files in This Item:

File	Description	Size	Format
2019 - Suzane Pereira Lima.pdf		2.93 MB	Adobe PDF	View/Open

Show simple item record