• UMIR Communities
    • UM Main
    • UM Bansalan
    • UM Digos
    • UM Guianga
    • UM Ilang-Tibungco
    • UM Panabo
    • UM Peñaplata
    • UM Tagum
  • Library Catalog
    • UM Main OPAC
    • UM Bansalan OPAC
    • UM Digos OPAC
    • UM Guianga OPAC
    • UM Ilang-Tibungco OPAC
    • UM Panabo OPAC
    • UM Peñapalata OPAC
    • UM Tagum OPAC
  • Login
 
View Item 
  •   UMIR Home
  • UM Tagum
  • Undergraduate Theses
  • View Item
  •   UMIR Home
  • UM Tagum
  • Undergraduate Theses
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.
Citation Tool

     
N/A

Text-based language prediction of three native languages of Davao del Norte using Multinomial Naive Bayes algorithm

View/Open
Manuscript Language Material
Date
2022-12
Author
Esperanza, Dianne Faith
Rulida, Kezze Krisna C.
Keywords
Algorithms -- Computer programming
Language and languages
Citation Tool
Metadata
Show full item record
Abstract
There are few technologically based native or minority language studies in Davao del Norte, which is home to several indigenous languages. Some of these minority languages are represented either insufficiently or not on the internet or any other electronic resource. As the internet becomes more pervasive in society on a global scale, digitization is becoming increasingly crucial for language preservation. Using Multinomial Naive Bayes, we developed a language identification tool to identify Davao del Norte native languages (Ata-Manobo, Cebuano, and Mansaka) presented in a text. We experimented with varying sizes and quality datasets until the desired level of accuracy and performance had been achieved. The classifier's accuracy did not change significantly after adding training data. However, when tested with new inputs, the classifier appeared to perform better than it did with a smaller dataset. The model attained an accuracy of 98.43% due to the incorporation of additional training data. The input (text) length is still an essential factor to consider for an accurate language prediction.
URI
https://repository.umindanao.edu.ph/handle/20.500.14045/2064
Collections
  • Undergraduate Theses [181]
Publisher
Department of Arts and Sciences Education- Bachelor of Science in Computer Science

 

 

Browse

All of UMIRCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

My Account

LoginRegister