Module talk:Lang/data
Latest comment: 1 month ago by OwenBlacker in topic Edit request 3 May 2026
| Module:Lang/data is indefinitely protected from editing as it is a heavily used or highly visible module. Substantial changes should first be proposed and discussed here on this page. If the proposal is uncontroversial or has been discussed and is supported by consensus, editors may use {{edit template-protected}} to notify an administrator or template editor to make the requested edit.
|
| This is the talk page for discussing improvements to the Lang/data module. |
|
| Archives: 1Auto-archiving period: 3 months |
| This module does not require a rating on Wikipedia's content assessment scale. It is of interest to the following WikiProjects: | ||||||||
| ||||||||
Transliteration systems for Tibetan
editIs there a reason why this module does not appear to include any systems for transliterating Tibetan, not even the major ones like THDL, ZWPY, or Wylie? ALA-LC is supported, but that seems to only be the case because there are ALA-LC schemes for other languages. – Scyrme (talk) 02:15, 6 March 2026 (UTC)
- @Trappist the monk: (Notifying you, as you previously answered edit requests here. If you're not interested or have no idea, feel free to ignore this, but I'd appreciate if you could let me know if there's someone who might have an answer.) – Scyrme (talk) 02:19, 6 March 2026 (UTC)
- Probably because no one has demonstrated a need for these systems. Can you demonstrate a need?
- —Trappist the monk (talk) 14:03, 6 March 2026 (UTC)
- There are multiple competing systems used on Wikipedia. Right now the only one supported is one of the least used ones, namely ALA-LC. Most articles for Tibetan topics use either Wylie and THDL, but ZWPY is the official standard of the PRC so it's not negligible. The use of differing systems is widespread and could benefit from the ability to tag. While it's often easy to tell which is Wylie by its extensive use of unusual consonant clusters for Tibetan etymological conjuncts and its preservation of final consonants, this isn't always so, and THDL and ZWPY especially have a lot of overlap. In some cases completely different words are transliterated to the same Latin text in different systems, so when transliterated text is provided without specifying a system or the original Tibetan script text it can be difficult to find the original word. For example, the initials c, cw, gc, bc, lc, py, lpy, spy, dpy in Wylie can all correspond to j in ZWPY, while THDL would use cha, which seems unambiguous, except Wylie uses j for ZWPY's q which THDL also renders as j, but THDL also uses j in cases where it would also be j in ZWPY. So if all you have is a transliteration and it's not tagged and it has a j in one of its syllables, it could be a wide range of very different words. Even just ja on its own could be བྱ, 'bird', or ཇ, 'tea', or ཆ, 'part'. Hopefully from this you can see how it can be easy to confuse which is which, especially if you're a reader who lacks context. Many Tibet articles are cluttered with repeated explicit script labels in order to avoid these confusions. See, for example, Kagyu which repeats "Wylie" 62 times. Not all articles are that bad, but it's enough of a problem to warrant adding support for {{tlit}} to tag the system. It already supports tagging for languages where transliteration/romanization is much less complicated.. – Scyrme (talk) 15:36, 6 March 2026 (UTC)
{{tlit|bo|thdl|text}}→<span title="THL Simplified Phonetic Transcription"><i lang="bo-Latn">text</i></span>→ text{{tlit|bo|wylie|text}}→<span title="Wylie transliteration"><i lang="bo-Latn">text</i></span>→ text{{tlit|bo|zwpy|text}}→<span title="Tibetan pinyin"><i lang="bo-Latn">text</i></span>→ text- —Trappist the monk (talk) 16:10, 6 March 2026 (UTC)
- Thanks! – Scyrme (talk) 17:42, 6 March 2026 (UTC)
- There are multiple competing systems used on Wikipedia. Right now the only one supported is one of the least used ones, namely ALA-LC. Most articles for Tibetan topics use either Wylie and THDL, but ZWPY is the official standard of the PRC so it's not negligible. The use of differing systems is widespread and could benefit from the ability to tag. While it's often easy to tell which is Wylie by its extensive use of unusual consonant clusters for Tibetan etymological conjuncts and its preservation of final consonants, this isn't always so, and THDL and ZWPY especially have a lot of overlap. In some cases completely different words are transliterated to the same Latin text in different systems, so when transliterated text is provided without specifying a system or the original Tibetan script text it can be difficult to find the original word. For example, the initials c, cw, gc, bc, lc, py, lpy, spy, dpy in Wylie can all correspond to j in ZWPY, while THDL would use cha, which seems unambiguous, except Wylie uses j for ZWPY's q which THDL also renders as j, but THDL also uses j in cases where it would also be j in ZWPY. So if all you have is a transliteration and it's not tagged and it has a j in one of its syllables, it could be a wide range of very different words. Even just ja on its own could be བྱ, 'bird', or ཇ, 'tea', or ཆ, 'part'. Hopefully from this you can see how it can be easy to confuse which is which, especially if you're a reader who lacks context. Many Tibet articles are cluttered with repeated explicit script labels in order to avoid these confusions. See, for example, Kagyu which repeats "Wylie" 62 times. Not all articles are that bad, but it's enough of a problem to warrant adding support for {{tlit}} to tag the system. It already supports tagging for languages where transliteration/romanization is much less complicated.. – Scyrme (talk) 15:36, 6 March 2026 (UTC)
Edit request 3 May 2026
editBefore line 2000, please add
["crb"] = "Kalinago", -- Island Carib; to match en.wiki article
per request at Category talk:Articles containing Island Carib-language text. I've made the appropriate edit to Module:Lang/data/sandbox, but I don't remember how to test a module include like this. — OwenBlacker (he/him; Talk) 21:29, 3 May 2026 (UTC)
{{lang/sandbox|crb|fn=name_from_tag|link=yes}}→ Kalinago{{lang/sandbox|crb|fn=category_from_tag|link=yes}}→ Category:Articles containing Kalinago-language text{{lang|crb|fn=name_from_tag|link=yes}}→ Kalinago{{lang|crb|fn=category_from_tag|link=yes}}→ Category:Articles containing Kalinago-language text- —Trappist the monk (talk) 21:41, 3 May 2026 (UTC)
- Aha, thank you.
- That looks good then — the purpose was to rename Category:Articles containing Island Carib-language text → Category:Articles containing Kalinago-language text. Thank you! — OwenBlacker (he/him; Talk) 22:34, 3 May 2026 (UTC)