The ROI of AI in lexicography

Authors

  • Erin McKean Wordnik
  • Will Fitzgerald Wordnik

DOI:

https://doi.org/10.1558/lexi.27569

Keywords:

LLMs, dictionary, lexicography, evaluation, text generation

Abstract

Large Language Models (LLMs) are being used for many language-based tasks, including translation, summarization and paraphrasing, sentiment analysis, and for content-generation tasks, such as code generation, answering search queries in natural language, and to power chatbots in customer service and other domains. Since much modern lexicography is based on investigation and analysis of large-scale corpora analogous to the (much larger) corpora used to train LLMs, we hypothesize that LLMs could be used for typical lexicographic tasks. A commercially-available LLM API (OpenAI’s ChatGPT gpt-3.5-turbo) was used to complete typical lexicographic tasks, such as headword expansion, phrase and form finding, and creation of definitions and examples. The results showed that the output of this LLM is not up to the standard of human editorial work, requiring significant oversight because of errors and “hallucinations” (the tendency of LLMs to invent facts). In addition, the externalities of LLM use, including concerns about environmental impact and replication of bias, add to the overall cost.

Author Biographies

  • Erin McKean, Wordnik

    Erin McKean is the founder and CEO of Wordnik. She was previously the Editor in Chief of American Dictionaries for Oxford University Press and the Editorial Manager for the Thorndike-Barnhart Dictionaries, and has served on the Board of Visitors for the Dictionary of American Regional English. A list of her books and links to her talks on lexicographic and other topics are available at erinmckean.com

  • Will Fitzgerald, Wordnik

    Will Fitzgerald is a staff machine learning engineer at GitHub and an advisor at large for the Wordnik Society.

References

Atkins, B. T. S., and Rundell, M. (2008). The Oxford guide to practical lexicography. Oxford: Oxford University Press.

Basque, K. (2023). Fine-tuning an LLM into a style guide editor. https://technicalwriting.tools/posts/style-guide-fine-tuning (last accessed May 6, 2023).

Béjoint, H. (2000) Modern lexicography: An introduction. Oxford: Oxford University Press.

Bender, E. M., Gebru, T., McMillan-Major, A., and Shmitchell, S. (2021). On the dangers of stochastic parrots: Can language models be too big? . Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency, 610–623. https://doi.org/10.1145/3442188.3445922

Biderman, S., Prashanth, U. S. V. S. N., Sutawika, L., Schoelkopf, H., Anthony, Q., Purohit, S., and Raf, E. (2023). Emergent and predictable memorization in large language models. https://arxiv.org/abs/2304.11158 (last accessed May 26, 2023).

Firth, J. R. (1957). A synopsis of linguistic theory, 1930–1955. In J. R. Firth (Ed.), Studies in linguistic analysis. Oxford: Basil Blackwell.

Franceschi-Bicchierai, L. (2023). Jailbreak tricks Discord’s new chatbot into sharing napalm and meth instructions. TechCrunch https://techcrunch.com/2023/04/20/jailbreak-tricks-discords-new-chatbot-into-sharing-napalm-and-meth-instructions/ (last accessed May 28, 2023).

GPT-J-6B. (2023). https://6b.eleuther.ai/ (last accessed May 23, 2023).

Haan, K. (2023). Over 75% of consumers are concerned about misinformation from artificial intelligence. Forbes Advisor https://www.forbes.com/advisor/business/artificial-intelligence-consumer-sentiment/ (last accessed May 26, 2023).

Hanks, P. (2008). Obituary for Laurence Urdang. International Journal of Lexicography, 21(4), 467–477.

Jackson, H. (2013). Lexicography: An introduction. Abingdon: Taylor and Francis.

Kirk. J. (2023). “Don Knuth plays with ChatGPT” but with ChatGPT-4. https://gist.github.com/Jessime/63f93215faed6f7109c6d62b7fef7fbc#file-knuth_questions-md (last accessed May 25, 2023).

Knuth, D. (2023). Untitled note. https://cs.stanford.edu/~knuth/chatGPT20.txt (last accessed May 26, 2023).

Korn, J. (2023). Vanderbilt University apologizes for using ChatGPT to write mass-shooting email. https://www.cnn.com/2023/02/22/tech/vanderbilt-chatgpt-shooting-email/index.html (last accessed May 26, 2023).

Landau, S. (2001). Dictionaries: The art and craft of lexicography (2nd ed.). Cambridge: Cambridge University Press.

Li, P., Yang, J., Islam, M., and Ren, S. (2023). Making AI less “thirsty”: Uncovering and addressing the secret water footprint of AI models. https://arxiv.org/pdf/2304.03271.pdf (last accessed May 26, 2023).

LinkedIn (2023a). https://www.linkedin.com/jobs/search/?currentJobId=3612512663&geoId=103644278&keywords=%22lexicography%22&location=United%20States&refresh=true (last accessed May 27, 2023).

LinkedIn (2023b). https://www.linkedin.com/jobs/search/?currentJobId=3406473842&geoId=103644278&keywords=%22machine%20learning%22%20%22data%20scientist%22&location=United%20States&refresh=true (last accessed May 27, 2023).

Luccioni, A., Viguier, S., and Ligozat, A. L. (2022). Estimating the carbon footprint of BLOOM, a 176B parameter language model. https://doi.org/10.48550/arXiv.2211.02001 (last accessed May 26, 2023).

Merriam-Webster. https://www.merriam-webster.com/ (last accessed May 24, 2023).

Nvidia (2024). Nvidia GPU — Design Life-Cycle. https://www.designlife-cycle.com/nvidia-gpu (last accessed May 2, 2024).

OpenAI (2023). https://github.com/openai/evals (last accessed May 28, 2023).

OpenAI Community Forum (2023). https://community.openai.com/t/cheat-sheet-mastering-temperature-and-top-p-in-chatgpt-api-a-few-tips-and-tricks-oncontrolling-the-creativity-deterministic-output-of-prompt-responses/172683 (last accessed May 26, 2023).

Orth, T. (2023) What Americans think about ChatGPT and AI-generated text https://today.yougov.com/topics/technology/articles-reports/2023/02/01/what-americans-think-about-chatgpt-and-ai-text (last accessed May 26, 2023).

Oxford Languages. https://languages.oup.com/why-has-oxford-dictionaries-changed/ (last accessed May 26, 2023).

Ralphson, M. (2018). A brief history of the OpenAPI Specification. https://dev.to/mikeralphson/a-brief-history-of-the-openapi-specification-3g27 (last accessed May 24, 2023).

SketchEngine (2023). https://www.sketchengine.eu. (last accessed May 24, 2023).

Svensén, B. (1993). Practical lexicography: Principles and methods of dictionary-making. Oxford: Oxford University Press.

T0. (2022). https://bigscience.huggingface.co/blog/t0. (last accessed May 26, 2023).

U.S. Copyright Office. (2023). Copyright registration guidance: Works containing material generated by artificial intelligence, Federal Register. https://www.federalregister.gov/documents/2023/03/16/2023-05321/copyright-registration-guidance-works-containing-material-generated-by-artificial-intelligence (last accessed May 27, 2023).

Vincent, J. (2023). OpenAI says it could “cease operating” in the EU if it can’t comply with future regulation. The Verge. https://www.theverge.com/2023/5/25/23737116/openai-ai-regulation-eu-ai-act-cease-operating (last accessed May 26, 2023).

Weiner, E. (2019). Digitizing the OED: The making of the second edition. https://public.oed.com/blog/digitizing-the-oed-the-making-of-the-second-edition/ (last accessed May 24, 2023).

Wiktionary (2023). Wiktionary contributors “cheesehead.” https://en.wiktionary.org/w/index.php?title=cheesehead&oldid=72967160 (last accessed May 27 2023).

Wiles, J. (2022). What’s new in artificial intelligence from the 2022 Gartner Hype Cycle. https://www.gartner.com/en/articles/what-s-new-in-artificial-intelligence-from-the-2022-gartner-hype-cycle (last accessed May 27, 2023).

WIRED. (2023). How WIRED will use generative AI tools. https://www.wired.com/about/generative-ai-policy/ (last accessed May 27, 2023).

Published

2024-07-11

How to Cite

McKean, E., & Fitzgerald, W. (2024). The ROI of AI in lexicography. Lexicography, 11(1), 7-27. https://doi.org/10.1558/lexi.27569