Topic: malay_text to meta

Posted under Tag Alias and Implication Suggestions

czyszy said:
The bulk update request #11348 is pending approval.

change category malay_text (1) -> meta

Reason: please change the category of the tag to make it uniform with the rest of language tags that are assigned to "Meta"

The proper meta tag already exists as either malaysian_text or indonesian_text.

I don't think we should keep a separate malay_text tag as the term "Malay" is vague and could refer to the language spoke in the region as a whole rather than the distinct "Malaysian" or "Indonesian" varieties.

thegreatwolfgang said:
The proper meta tag already exists as either malaysian_text or indonesian_text.

I don't think we should keep a separate malay_text tag as the term "Malay" is vague and could refer to the language spoke in the region as a whole rather than the distinct "Malaysian" or "Indonesian" varieties.

But that's exactly my point. That post features "malay text" (general category), and more precisely "indonesian text" (specific dialect). Though I'm not an expert on Malay/Bahasa dialectology so I didn't add the Indonesian tag. Many times written Malay can be amiguous.

So maybe let's do it this way. We're gonna add "malay text" as an implication linked with both "indonesian text" and "malaysian text". From what I've gathered, though both dialects are distinct, they're mostly mutually intelligible.

And also, it would be cool to add serbocroatian_text as an implication for serbian_text and croatian_text, since it's a similar case.

Updated

czyszy said:
But that's exactly my point. That post features "malay text" (general category), and more precisely "indonesian text" (specific dialect). Though I'm not an expert on Malay/Bahasa dialectology so I didn't add the Indonesian tag. Many times written Malay can be amiguous.

So maybe let's do it this way. We're gonna add "malay text" as an implication linked with both "indonesian text" and "malaysian text". From what I've gathered, though both dialects are distinct, they're mostly mutually intelligible.

And also, it would be cool to add serbocroatian_text as an implication for serbian_text and croatian_text, since it's a similar case.

You could make the argument that we should lump mutually intelligible languages under their respective umbrella language tags (e.g., imply belarusian_text, ukrainian_text, and russian_text -> east_slavic_text).
However, I don't think there is any precedence for such categorisation. I also personally don't find that to be necessary, but actually excessive and unhelpful.

Since the language tags are under the meta category to begin with, we do not have to strictly follow TWYS.
You can easily find the exact language used by checking the source or the artist's profile, which in turn would make translating and searching for that specific language much easier.
Adding a vague umbrella language tag would not help improve the searchability of the implied subtags, but would instead permit users to "ambiguously" tag the language used rather than "distinctively".

In short, a hypothetical malay_text umbrella tag would be no different than searching for ~malaysian_text ~indonesian_text. There is simply no need for it.
If users still can't figure out what exact language is being used, then they can simply add unknown_language and ask for help from others.

thegreatwolfgang said:
You could make the argument that we should lump mutually intelligible languages under their respective umbrella language tags (e.g., imply belarusian_text, ukrainian_text, and russian_text -> east_slavic_text).
However, I don't think there is any precedence for such categorisation. I also personally don't find that to be necessary, but actually excessive and unhelpful.

Since the language tags are under the meta category to begin with, we do not have to strictly follow TWYS.
You can easily find the exact language used by checking the source or the artist's profile, which in turn would make translating and searching for that specific language much easier.
Adding a vague umbrella language tag would not help improve the searchability of the implied subtags, but would instead permit users to "ambiguously" tag the language used rather than "distinctively".

In short, a hypothetical malay_text umbrella tag would be no different than searching for ~malaysian_text ~indonesian_text. There is simply no need for it.
If users still can't figure out what exact language is being used, then they can simply add unknown_language and ask for help from others.

Fair enough. 🗿