Parsing Words and Building Meanings: A Natural Language Processing Study

Anand Rao Sanjay Kumar

Authors

Anand Rao Sanjay Kumar USA Author

Keywords:

POS Tagging, Natural Language, Building Meanings, Parsing words, Syntactic Parsing words

Abstract

This research paper explored natural language processing, word parsing, and meaning in terms of their implications for high-speed computing. Furthermore, it aimed to understand the connection between NLP and possibilities of language understanding in computers and how this knowledge can be applied to contemporary computer technology. The identified key barriers and solutions to NLP problems concerned the human inability to find the sequence and context notion that would frame even similarly disturbing behavior. The knowledge was enhanced by applying the theoretical framework of the distributional hypothesis and part-of-speech tagging in parsing methods for syntactic analysis pro and knowledge of hierarchical structure to maximize it. At the same time, it also addressed the method of increasing the number of performance of named entities per sentence paring them into several parts & ambiguity in the part of the speech to which a token belonged, and complexity in the previous named entity segmentation. The available data were obtained from newspapers & publishers: the editorials of newspapers and magazines, English imaginable sentences from Urdu newspapers like Edawn, and other public newspapers in Pakistan. A corpus of editorials was built through pre-processing published materials including tokenization, which involved distinguishing words and phrases, preparing a network of words substructure in semantic parsing pairs, and removing special tokens or brackets. The results show successful parsing & little sentential ambiguity & syntactic tree’s complexity. The high-profile result established human-computer communication possibilities at the next level and optimizations in NLP functionality for translation, information retrieval, and virtual assistant’s techniques, influencing notable theoretical contributions.

References

Akhil, K. K., Rajimol, R., & Anoop, V. S. (2020). Parts-of-Speech tagging for Mala-yalam using deep learning techniques. International Journal of Information Tech-nology, 12(3), 741-748.

Anita, R., & Subalalitha, C. N. (2019, December). Building discourse parser for Thirukkural. In Proceedings of the 16th International Conference on Natural Lan-guage Processing (pp. 18-25).

Bakhshi, M., Nematbakhsh, M., Mohsenzadeh, M., & Rahmani, A. M. (2022). SParseQA: Sequential word reordering and parsing for answering complex natural language questions over knowledge graphs. Knowledge-Based Systems, 235, 107626.

Foland, W., & Martin, J. H. (2017, July). Abstract meaning representation parsing using lstm recurrent neural networks. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (pp. 463-472).

He, H., & Choi, J. (2020, May). Establishing strong baselines for the new decade: Sequence tagging, syntactic and semantic parsing with BERT. In The Thirty-Third International Flairs Conference.

Khurana, D., Koli, A., Khatter, K., & Singh, S. (2023). Natural language processing: State of the art, current trends and challenges. Multimedia tools and applications, 82(3), 3713-3744.

Kwiatkowski, T., Choi, E., Artzi, Y., & Zettlemoyer, L. (2013, October). Scaling se-mantic parsers with on-the-fly ontology matching. In Proceedings of the 2013 con-ference on empirical methods in natural language processing (pp. 1545-1556).

Lee, K., Artzi, Y., Dodge, J., & Zettlemoyer, L. (2014, June). Context-dependent se-mantic parsing for time expressions. In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (pp. 1437-1447).

Pattnaik, S., & Nayak, A. K. (2022). A Modified Markov-Based Maximum-Entropy Model for POS Tagging of Odia Text. International Journal of Decision Support System Technology (IJDSST), 14(1), 1-24

Song, J., Kim, J., & Lee, J. K. (2018). NLP and deep learning-based analysis of building regulations to support automated rule checking system. In ISARC. Pro-ceedings of the International Symposium on Automation and Robotics in Construc-tion (Vol. 35, pp. 1-7). IAARC Publications.

Wang, C., Ross, C., Kuo, Y. L., Katz, B., & Barbu, A. (2021, October). Learning a natural-language to LTL executable semantic parser for grounded robotics. In Con-ference on Robot Learning (pp. 1706-1718). PMLR.

Warjri, S., Pakray, P., Lyngdoh, S. A., & Maji, A. K. (2021). Part-of-speech (POS) tagging using conditional random field (CRF) model for Khasi corpora. International Journal of Speech Technology, 24(4), 853-864

Wang, Y., Berant, J., & Liang, P. (2015, July). Building a semantic parser overnight. In Proceedings of the 53rd Annual Meeting of the Association for Computational Lin-guistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers) (pp. 1332-1342).

Nivedhaa N, " From Raw Data to Actionable Insights: A HolisticSurvey of Data Science Processes," International Journal of Data Science (IJDS), vol.1, issue 1, pp. 1-16, 2024