| dc.contributor.advisor | Benjamin M. Mahinay, Jr., MIT | |
| dc.contributor.author | Esperanza, Dianne Faith | |
| dc.contributor.author | Rulida, Kezze Krisna C. | |
| dc.date.accessioned | 2025-11-10T01:57:24Z | |
| dc.date.available | 2025-11-10T01:57:24Z | |
| dc.date.copyright | 2022 | |
| dc.date.issued | 2022-12 | |
| dc.identifier.uri | https://repository.umindanao.edu.ph/handle/20.500.14045/2064 | |
| dc.description | A thesis presented to the thesis committee of the Department of Arts and Sciences Education. In partial fulfillment of the requirements for the degree Bachelor of Science in Computer Science. | en_US |
| dc.description.abstract | There are few technologically based native or minority language studies in Davao del Norte, which is home to several indigenous languages. Some of these minority languages are represented either insufficiently or not on the internet or any other electronic resource. As the internet becomes more pervasive in society on a global scale, digitization is becoming increasingly crucial for language preservation. Using Multinomial Naive Bayes, we developed a language identification tool to identify Davao del Norte native languages (Ata-Manobo, Cebuano, and Mansaka) presented in a text. We experimented with varying sizes and quality datasets until the desired level of accuracy and performance had been achieved. The classifier's accuracy did not change significantly after adding training data. However, when tested with new inputs, the classifier appeared to perform better than it did with a smaller dataset. The model attained an accuracy of 98.43% due to the incorporation of additional training data. The input (text) length is still an essential factor to consider for an accurate language prediction. | en_US |
| dc.language.iso | en_US | en_US |
| dc.publisher | Department of Arts and Sciences Education- Bachelor of Science in Computer Science | en_US |
| dc.rights | UM Tagum College LIC | |
| dc.subject | Algorithms -- Computer programming | en_US |
| dc.subject | Language and languages | en_US |
| dc.title | Text-based language prediction of three native languages of Davao del Norte using Multinomial Naive Bayes algorithm | en_US |
| dc.type | Thesis | en_US |
| dc.contributor.panel | John Jefferson L. Dela Cruz, MIT | |
| dc.contributor.panel | Iris Mae C. Mendoza, MIT | |
| dc.description.xtnt | vii, 26 pages | |