Unregistered words in everyday language and a proposal for their optimal lexicographic microstructure


  • Yinxia Huang Pai Chai University
  • Kilim Nam Kyungpook National University




unregistered words, instant messenger corpus, microstructure, lexicography, neologisms


This article looks into lexicographic adaptation to media change. Instant messengers in Korea function as the most popular communication medium. According to the latest survey by Gallup Korea, instant messengers are used by 92% of the population overall. It means that the instant messenger corpus provides an ideal resource for accessing the language of the masses from a corpus linguistic point of view. In this contribution, we analyze an instant messenger corpus of 1.4 million words, and look into the prevalent unregistered words in the corpus to propose a microstructural model for them. Section 2 introduces the normalized parallel corpus of Messenger used in this study, and discusses the extraction methodology for unregistered words. We discuss the operational definition of unregistered words for dictionary inclusion and their extraction process. Section 3 examines the prevalence of unregistered words in the defined Messenger corpus and categorizes them based on the characteristics of messenger language. These characteristics encompass deviations from the pre-existing writing system, deviations from linguistic norms, deviations from socio-ethical criteria, incidental omissions, and non-verbal expressions. Section 4 proposes an optimal lexicographical structure incorporating unregistered words and their characteristics identified in the previous sections. Additionally, we discuss the extension and modification of microstructures in existing dictionaries, which could be made to effectively represent this new medium’s language.

Author Biographies

  • Yinxia Huang, Pai Chai University

    Yinxia Huang is an assistant professor in linguistics and has been teaching since 2011 at the Department of Korean Language, Literature, and Education at Pai Chai University, South Korea. She received her doctoral degree in corpus linguistics from the Yonsei University, Seoul. She is currently a board member of KOREALEX. Her major interests are corpus linguistics, contrastive linguistics, and lexicography.

  • Kilim Nam, Kyungpook National University

    Kilim Nam is a professor at the Department of Korean Language and Literature at Kyungpook National University (Daegu, South Korea). She holds a PhD in Korean linguistics (on the copula ida structures in contemporary Korean, 2004) from Yonsei University (Seoul). She is currently a board member of KOREALEX and president of ASIALEX. She has been the principal investigator of the Korean Neologism Investigation Project since 2012. Her research focuses on corpus linguistics and language performance.


Huang, Y., & Nam, K. (2023). Unregistered words in everyday language and a proposal for their optimal lexicographic microstructure. Lexicography, 10(2), 94-116. https://doi.org/10.1558/lexi.26357