Talk:Large language model
| This is the talk page for discussing improvements to the Large language model article. This is not a forum for general discussion of the subject of the article. |
Article policies
|
| Find sources: Google (books · news · scholar · free images · WP refs) · FENS · JSTOR · TWL |
| Archives: 1Auto-archiving period: 3 months |
| This It is of interest to multiple WikiProjects. | |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
|
Lead length and complexity
editHi 0xReflektor. I saw that you fixed the template "lead too short" in the article large language model; the lead was indeed too short (around half the size of a typical lead) but has now become overly long (around twice the size of a typical lead). We should probably condense it, or move some of its content to the rest of the article.
I also believe the lead is too difficult to understand for a Wikipedia audience (WP:TECHNICAL). It would be well-written for a research article, but many people come to Wikipedia to discover what the concept means, and so we should try to write the lead so that they understand most of it even if they lack many of the underlying concepts and jargon. Alenoach (talk) 14:27, 4 October 2025 (UTC)
- I add this message just to indicate that I removed on October 11 one lead paragraph and integrated another into the body. The lead's length is more normal now. There is still the issue with the technical complexity though. Alenoach (talk) 18:42, 20 October 2025 (UTC)
- I attempted to reduce the lead's technical complexity by simplifying language and moving some more technical sections to the body. I also removed terminology that the source papers did not use, such as "few-shot learning" and "hill climbing". Diff for posterity and small subsequent copy edit. SenshiSun (talk) 21:18, 26 March 2026 (UTC)
Large language models as marketing channels
editI've been attempting to clean up and improve the articles related to using LLMs as a marketing channel. Specifically Generative engine optimization, AI SEO, Answer engine optimization, and Search engine optimization. I then noticed that this article has no mention of this. I am not suggesting that promotional language be added in here. I think it could even live within the "societal concerns" section. Curious what other editors think. Dflovett (talk) 12:28, 13 February 2026 (UTC)
- May work better at Chatbot. Unless you are talking about training poisoning then I don't think this is DUE for LLMs as a general concept. Of course, it depends on the sources. Czarking0 (talk) 15:49, 13 February 2026 (UTC)
- I think there's a thin line between training poisoning and marketing but yeah, I'll start there. Makes sense. Dflovett (talk) 15:03, 16 February 2026 (UTC)
Chat hack
editShould this be mentioned somewhere ? Yesterday, all my dreams... (talk) 22:36, 20 February 2026 (UTC)
This is unclear
editMoving beyond n-gram models, researchers started in 2000 to use neural networks to "learn" language models
Does this really mean "learn about" or perhaps "teach".
Its important: the reader cannot easily understand what's actually meant here. ~2026-24004-11 (talk) 13:35, 10 May 2026 (UTC)
- You're right that there was an issue with this sentence, thanks for reporting. I replaced it with "to use neural networks as language models." Alenoach (talk) 15:09, 10 May 2026 (UTC)
