Talk:Large language model

Learn more about this page

This is the talk page for discussing improvements to the Large language model article.
This is not a forum for general discussion of the subject of the article.

Add new text under old text.
New to Wikipedia? Welcome! Learn to edit; get help.

Start a new topic

Article policies

Find sources: Google (books · news · scholar · free images · WP refs) · FENS · JSTOR · TWL

Archives: 1: 3 months

Technology

This article is within the scope of WikiProject Technology, a collaborative effort to improve the coverage of technology on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.TechnologyWikipedia:WikiProject TechnologyTemplate:WikiProject TechnologyTechnology

Linguistics: Applied Linguistics Mid‑importance

	Linguistics portal This article is within the scope of WikiProject Linguistics, a collaborative effort to improve the coverage of linguistics on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.LinguisticsWikipedia:WikiProject LinguisticsTemplate:WikiProject LinguisticsLinguistics
Mid	This article has been rated as Mid-importance on the project's importance scale.
	This article is supported by Applied Linguistics Task Force.

Robotics Mid‑importance

	This article is within the scope of WikiProject Robotics, a collaborative effort to improve the coverage of Robotics on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.RoboticsWikipedia:WikiProject RoboticsTemplate:WikiProject RoboticsRobotics
Mid	This article has been rated as Mid-importance on the project's importance scale.

Computing Top‑importance

	This article is within the scope of WikiProject Computing, a collaborative effort to improve the coverage of computers, computing, and information technology on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.ComputingWikipedia:WikiProject ComputingTemplate:WikiProject ComputingComputing
Top	This article has been rated as Top-importance on the project's importance scale.

Artificial Intelligence

	This article is within the scope of WikiProject Artificial Intelligence, a collaborative effort to improve the coverage of Artificial intelligence on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.Artificial IntelligenceWikipedia:WikiProject Artificial IntelligenceTemplate:WikiProject Artificial IntelligenceArtificial Intelligence
???	This article has not yet received a rating on the project's importance scale.

Lead length and complexity

Latest comment: 2 months ago3 comments2 people in discussion

Hi 0xReflektor. I saw that you fixed the template "lead too short" in the article large language model; the lead was indeed too short (around half the size of a typical lead) but has now become overly long (around twice the size of a typical lead). We should probably condense it, or move some of its content to the rest of the article.

I also believe the lead is too difficult to understand for a Wikipedia audience (WP:TECHNICAL). It would be well-written for a research article, but many people come to Wikipedia to discover what the concept means, and so we should try to write the lead so that they understand most of it even if they lack many of the underlying concepts and jargon. Alenoach (talk) 14:27, 4 October 2025 (UTC)Reply

I add this message just to indicate that I removed on October 11 one lead paragraph and integrated another into the body. The lead's length is more normal now. There is still the issue with the technical complexity though. Alenoach (talk) 18:42, 20 October 2025 (UTC)Reply

I attempted to reduce the lead's technical complexity by simplifying language and moving some more technical sections to the body. I also removed terminology that the source papers did not use, such as "few-shot learning" and "hill climbing". Diff for posterity and small subsequent copy edit. SenshiSun (talk) 21:18, 26 March 2026 (UTC)Reply

Large language models as marketing channels

Latest comment: 3 months ago3 comments2 people in discussion

I've been attempting to clean up and improve the articles related to using LLMs as a marketing channel. Specifically Generative engine optimization, AI SEO, Answer engine optimization, and Search engine optimization. I then noticed that this article has no mention of this. I am not suggesting that promotional language be added in here. I think it could even live within the "societal concerns" section. Curious what other editors think. Dflovett (talk) 12:28, 13 February 2026 (UTC)Reply

May work better at Chatbot. Unless you are talking about training poisoning then I don't think this is DUE for LLMs as a general concept. Of course, it depends on the sources. Czarking0 (talk) 15:49, 13 February 2026 (UTC)Reply

I think there's a thin line between training poisoning and marketing but yeah, I'll start there. Makes sense. Dflovett (talk) 15:03, 16 February 2026 (UTC)Reply

Chat hack

Latest comment: 3 months ago1 comment1 person in discussion

Should this be mentioned somewhere ? Yesterday, all my dreams... (talk) 22:36, 20 February 2026 (UTC)Reply

This is unclear

Latest comment: 26 days ago2 comments2 people in discussion

Moving beyond n-gram models, researchers started in 2000 to use neural networks to "learn" language models

Does this really mean "learn about" or perhaps "teach".

Its important: the reader cannot easily understand what's actually meant here. ~2026-24004-11 (talk) 13:35, 10 May 2026 (UTC)Reply

You're right that there was an issue with this sentence, thanks for reporting. I replaced it with "to use neural networks as language models." Alenoach (talk) 15:09, 10 May 2026 (UTC)Reply

Add topic