Cost-Sensitive Algorithms for Text Classification in the Legal Domain::  Addressing Imbalanced Lawsuit Themes

daniela

doi:10.5540/tcam.2025.026.e01859

Cost-Sensitive Algorithms for Text Classification in the Legal Domain:

Addressing Imbalanced Lawsuit Themes

Autores

daniela USP https://orcid.org/0000-0002-5363-3608

DOI:

https://doi.org/10.5540/tcam.2025.026.e01859

Palavras-chave:

mbalanced classification, cost-sensitive learning, machine learning, resampling, text classification

Resumo

This article discusses the challenges of imbalanced classification in machine learning, where algorithms often incorrectly assume an even distribution of instances between classes. This issue is common in real-world scenarios, leading to poor representation of minority classes in training data. To combat this, Cost-Sensitive Learning techniques have been developed, focussing on minimising the overall cost of misclassification rather than merely optimising accuracy. These techniques are categorised into three types: Cost-Sensitive Resampling, Algorithms, and Hybrid techniques. The research presents a case study on classifying lawsuits into repetitive themes in São Paulo Court, Brazil, using these cost-sensitive approaches on an imbalanced dataset. The goal is to automate the classification of lawsuits to save time, use human resources more effectively, and speed up the resolution of the lawsuit. The study highlights the effectiveness of cost-sensitive techniques in handling imbalanced classification and their benefits in real-world applications, particularly in the legal field, by improving efficiency and reducing manual workload and processing time for lawsuits.

Downloads

pdf (English)

Publicado

2025-11-28

Como Citar

daniela. (2025). Cost-Sensitive Algorithms for Text Classification in the Legal Domain:: Addressing Imbalanced Lawsuit Themes. Trends in Computational and Applied Mathematics, 26(1), e01859. https://doi.org/10.5540/tcam.2025.026.e01859

Baixar Citação

Edição

v. 26 n. 1 (2025)

Seção

Artigo Original

Licença

Este trabalho está licenciado sob uma licença Creative Commons Attribution-NoDerivatives 4.0 International License.

Direitos Autorais

Autores de artigos publicados no periódico Trends in Computational and Applied Mathematics mantêm os direitos autorais de seus trabalhos. O periódico utiliza a Atribuição Creative Commons (CC-BY) nos artigos publicados. Os autores concedem ao periódico o direito de primeira publicação.

Propriedade Intelectual e Termos de uso

O conteúdo dos artigos é de responsabilidade exclusiva dos autores. O periódico utiliza a Atribuição Creative Commons (CC-BY) nos artigos publicados. Esta licença permite que os artigos publicados sejam reutilizados sem permissão para qualquer finalidade, desde que o trabalho original seja corretamente citado.

O periódico encoraja os Autores a autoarquivar seus manuscritos aceitos, publicando-os em blogs pessoais, repositórios institucionais e mídias sociais acadêmicas, bem como postando-os em suas mídias sociais pessoais, desde que seja incluída a citação completa à versão do website da revista.

Cost-Sensitive Algorithms for Text Classification in the Legal Domain:

Addressing Imbalanced Lawsuit Themes

Autores

DOI:

Palavras-chave:

Resumo

Downloads

Publicado

Como Citar

Edição

Seção

Licença

Enviar Submissão

ISSN

ISSN 2676-0029

Palavras-chave

Indexadores

Informações

Desenvolvido por