てはいさ said in forum #440873:
I'll be honest: does anyone care if the image is just dialogue or just thoughts? In my opinion, the only significant difference is between "dialogue" in the broadest sense, text on objects like packaging held by characters, author's comments, and utility text (e.g., on gacha banners).
Our base text tags are used whenever a given language is represented with the exception of Japanese with its supermajority over the site. Translation request doesn't have anything narrowing down what specifically folks want translated, with that being effectively decided by the translator (meaning that, for instance, they can opt out of translating common sound effectseven if we have a List of SFX for translation reference, or unimportant text like clothes writing or billboards if they wanted to, because they assume the thing people care most about for translation is just the core text, whether it be dialogue, thoughts, narration, etc.).
Having something to distinguish the various kinds of text in an image allows people to not just search for specific kinds of comics, whether for reference or similar, but allows translators to pick and choose contexts which they want to translate. Until we do something the fact that we don't have narrower searches for comics specifically, topic #30094, reducing searching options isn't particularly helpful for anyone.
In that regard, evazion's argument for a speech tag has some weight. But the reason why I emphasize the point of 'not being able to recognize whether something is actually speech or not' is that this can eventually result in speech basically becoming a text umbrella tag, since one can assume that most text in posts will be dialogue, thoughts, narration, etc. and indirectly through that also be a Japanese text tag, since doing speech -*text would give you Japanese text.