The Role of Artificial Intelligence in Modern-Term Formation

Nigar Sadigova

Abstract

The rapid expansion of scientific knowledge and digital content in the 21st century has significantly increased the demand for accurate, efficient, and scalable terminology development processes. Term formationthe creation, extraction, validation, and standardisation of specialised vocabularyhas traditionally been a manual endeavour by linguists, terminologists, and subject matter experts. However, with the growing complexity and volume of information, traditional methods are no longer sufficient to keep pace. Recent advancements in Artificial Intelligence (AI), particularly in Natural Language Processing (NLP), machine learning, and knowledge representation, are transforming terminology science. AI-powered systems can process vast multilingual and domain-specific corpora to automatically identify candidate terms, evaluate their relevance using semantic and statistical criteria, and suggest standardised forms and multilingual equivalents. These technologies support the automation of term extraction, filtering, clustering of semantic variants, and alignment across languages and disciplines. This article explores the core applications of AI in term formation, including automatic term extraction, term validation and filtering, semantic clustering, and standardisation. It also addresses integrating AI tools in multilingual environments and constructing terminological resources compatible with ontologies and knowledge graphs. While AI introduces speed, scalability, and contextual awareness to terminology management, it also raises challenges related to accuracy, cultural and linguistic sensitivity, and algorithmic bias. The study concludes that a hybrid model – combining AI's computational capabilities with expert human judgment – offers the most promising path toward creating dynamic, inclusive, and globally coherent terminological systems.



Keywords


artificial intelligence (AI); term formation; terminology management; machine learning

Full Text:

PDF


References


1. IAEA. (2022). Nuclear Safety and Security Glossary. doi: 10.61092/iaea.rrxi-t56z

2. Sadigova, N. (2024). Terminology of Nuclear Energy Field based on the English Language. Path of Science, 10(9), 5007–5013. doi: 10.22178/pos.108-14

3. Sadigova, N., & Aliyeva, S. (2025). Methods of terms formation in nuclear medicine. Open Research Europe, 5, 46. doi: 10.12688/openreseurope.18941.1

4. Drouin, P. (2003). Term extraction using non-technical corpora as a point of leverage. Terminology, 9(1), 99–115. doi: 10.1075/term.9.1.06dro

5. Devlin, J., Chang, M. W., Lee, K., & Toutanova, K. (2019). BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of NAACL-HLT (pp. 4171–4186). Retrieved from https://arxiv.org/abs/1810.04805

6. Cambria, E., & White, B. (2014). Jumping NLP Curves: A Review of Natural Language Processing Research. IEEE Computational Intelligence Magazine, 9(2), 48–57. doi: 10.1109/mci.2014.2307227

7. Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., ... & Polosukhin, I. (2017). Attention is all you need. In Advances in Neural Information Processing Systems, 30. Retrieved from https://arxiv.org/abs/1706.03762

8. Wu, Y., Schuster, M., Chen, Z., Le, Q. V., Norouzi, M., Macherey, W., ... & Dean, J. (2016). Google's Neural Machine Translation System: Bridging the human and machine translation gap. Retrieved from https://arxiv.org/abs/1609.08144

9. Navigli, R., & Velardi, P. (2004). Learning Domain Ontologies from Document Warehouses and Dedicated Web Sites. Computational Linguistics, 30(2), 151–179. doi: 10.1162/089120104323093276

10. Haciyeva, N., & Sadiqova, N. (2023). Terminology in nuclear energy in the Azerbaijani language based on English. Terminology Issues, 01, 18. doi: 10.59849/2663-8967.2023.1.18


Article Metrics

Metrics Loading ...

Metrics powered by PLOS ALM

Refbacks

  • There are currently no refbacks.




Copyright (c) 2025 Nigar Sadigova

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.