elprupneerg

my favorite colors backwards

  • xe/xer, per/per, it/its, they/them

hi! my only other social media is tumblr so my gf says i'll fit in fine here. i'm in my 20s and live in the united states, if you want more info than that then you can read my posts cuz i'm not putting it here (put in some fucking effort to dox me lmao) <3

You must log in to comment.

in reply to @0xabad1dea's post:

Ah, the joys of crowdsourced translations.

I wonder if that portmanteau pops up enough in the internet corpus they fed into the model, or if enough people gamed some 'suggest a better translation' functionality like they used to do with Google Translate.

I don't think this is caused by crowdsourcing or anyone having ever actually said this ever, but by the recent switch to LLM translators that work on tokens that approximate morphemes. It's just kind of losing track mid-word and using the front half of one acceptable translation and the back half of another. This same translator renders 蛐蛐儿, an informal but normal Chinese word for cricket, as crift, crips, crike, criet... it always gets the "cri-" right and then just kind of slaps a random token on the end.

Interesting, I would have expected a LLM translator to still be weighted by the context window of the output language towards sequences of tokens that occurs more frequently in the target language. (The Chinese word examples looks a lot more like that with several of the possibilities being words or subsets of words.)

If they do lose the output language context when translating certain tokens, that seems like a serious regression towards the behavior of extremely early machine translation systems.

i am a native speaker and i'm with abadidea on this, yeah. it's the continuous form

i'd also say it can be translated as 'beeping', a more neutral sound, since it's also used for something like an alarm clock. also figuratively for 'whining'