wmd, to random
@wmd@chaos.social avatar

How can unicode ever be considered complete without a :guillotine: emoji...

typographica, to languagelearning
@typographica@typo.social avatar

SILICON Digitally Disadvantaged Languages Fellowship Program: Call for Proposals.

SILICON invites applications from keyboard designers, app designers, type designers, Large Language Model designers, language ethnographers, OCR/ML experts, and digital typographic experts for inaugural SILICON Fellowship Program. Awards of up to $7,000 to be granted.

https://docs.google.com/forms/d/e/1FAIpQLSdx4mUMwhHNu1Lgxr0bKlcYcTNY3WdQWQQyVKgCzXe9SAPYLA/viewform

elilla, to random
@elilla@transmom.love avatar

everybody's hyped for the new 1FAE9 FACE WITH BAGS UNDER EYES, which I mean, :big_mood:, but for you appreciators of text-presentation glyphs, might I draw your attention to the new range "Symbols for Legacy Computing Supplement" (1CC00–1CEBF) that includes several gaming sprites from retrocomputer codesets including Pac-Man, a full set of Space Invaders, 1CC96 FLAPPING BIRD and so on—as well as a full set of box characters for emulators?

https://www.unicode.org/charts/PDF/Unicode-16.0/U160-1CC00.pdf

Another section showing some old terminal drawing elements ("white lower left pointer", "two rings aligned horizontally", "inverse black diamond" etc.) and more game sprites (such as tanks and racing cars and fish in various positions).

liaizon,
@liaizon@wake.st avatar

@elilla this is exciting...

exegete, to hebrew
@exegete@autonomous.zone avatar

I just stumbled onto something horrifying, neo-Nazi symbolism seemingly hidden away in . The first Unicode codepoint, corresponding to א, is u05D0. The integer corresponding to the hex? 1488. You can't convince me that was a mere coincidence.

Who planned this???

CharlotteBuff,
@CharlotteBuff@mastodon.social avatar

@krans @zyd @exegete Are we really doing “NYC is ☠️✡️👍️ in Wingdings” again after 32 years?

liaizon,
@liaizon@wake.st avatar
spacemagick, to Futurology
@spacemagick@mastodon.social avatar
drq,
@drq@mastodon.ml avatar

@spacemagick Fucking... Exactly.

Why? WHYYYYYY? Do we really need this shit in Unicode? Or did we fucking run out of things to do as far as writing systems are concerned? Somebody over at Unicode Consortium has too much free time, g-sus.

Edent, to webdev
@Edent@mastodon.social avatar

🆕 blog! “Accents and eBooks”

By and large, the English language doesn't use diacritical marks. Even our loanwords are stripped of them; we drink in a cafe rather than the more pretentious café. This has a consequence for HTML and, by extension, eBooks. As a quick primer, modern computing gives us two main ways of displaying a letter with an […]

👀 Read more: https://shkspr.mobi/blog/2024/05/accents-and-ebooks/

CTD,
@CTD@mastodon.social avatar

@Edent The second dimension to this is to find a font for your ebook that supports these natively. My reader does show the correct glyph but if it isn’t present in the actual font it uses some kind of default and that is jarring.

blog, to webdev
@blog@shkspr.mobi avatar

Accents and eBooks
https://shkspr.mobi/blog/2024/05/accents-and-ebooks/

By and large, the English language doesn't use diacritical marks. Even our loanwords are stripped of them; we drink in a cafe rather than the more pretentious café. This has a consequence for HTML and, by extension, eBooks.

As a quick primer, modern computing gives us two main ways of displaying a letter with an accent. The first is simple - encode every single accented letter as a separate "pre-composed" character. So è (U+00E8), é (U+00E0), ê (U+00EA, and ë (U+00EB) are all stored as different codepoints.

But this seems a little inefficient and can make it hard to search through text for an exact lexical match.

So there is a second way to add accents. You take the base character - e (U+0065) - and then apply a separate "combining" accent character to it. For example the combining accent ◌́ (U+0301). That means you can add an accent to áńý ĺét́t́éŕ!́

Note, the accent ◌́ (U+0301) is separate from the character ´ (U+00B4). In fact, most accents have a pre-composed, combining, and separate form. This, understandably, causes much confusion!

Here's a good example. I was reading the excellent Fallen Idols, when I noticed this typesetting bug.

The phrase "Swords of Qadisiyyah." But the combining macron over the letter "a" has been rendered as a separate dash.

It's always hard to transliterate languages. The Victory Arch in Iraq is known as قوس النصر, and usually written in English as the "Swords of Qādisīyah".

Examining the HTML code in the eBook, it was obvious that the publishers had used a macron ¯ (U+00AF) rather than the combining version ◌̄ (U+0304).

I've reported it to the publisher. I've no idea if they'll fix it in a subsequent re-issue.

https://shkspr.mobi/blog/2024/05/accents-and-ebooks/

Edent, to random
@Edent@mastodon.social avatar

The nice thing about finding a typographical mistake in an is that they files are just HTML.
You can inspect element and see where the bug is.

In this case, they used &; when they clearly meant &;

pip,

@Edent my 14 year old has been doing this with translations of Korean web novels and submitting fixes back upstream - from his phone. 🛠

Wuzzy, to Game
@Wuzzy@cyberplace.social avatar

#Repixture 3.15.1 is here!

Sign text will now finally be shown on the sign. Over 60,000 glyphs are supported. A new pole sign can stand on the ground, hang from the ceiling or a wall.

There's also a spyglass item, and moon phases have been added.

▶️ Release notes: https://forum.minetest.net/viewtopic.php?p=435725#p435725
▶️ ContentDB page: https://content.minetest.net/packages/Wuzzy/repixture/

#Game #Minetest #release #sandbox #voxel #Unicode

lukiss, to random

( //
q=[
[[" ","│"," "],["╮","╰","┬"],["╰","┬","╯"]],
[["╭","╯","◦"],["┴","┬","─"],[" ","│"," "]],
[["o","│"," "],["─","╯","╭"],[" ","╭","╯"]],
[[" ","╰","╮"],["╮"," ","╰"],["╰","╮"," "]],
[["╭","┴","╮"],["┤","°","╰"],["╰","╮","◯"]],
[["╭","┴","╮"],["┤","O","├"],["╰","┬","╯"]],
];
8.do{
t=q.pyramid(3).scramble;
t[0].size.do{|l|
t.size.do{|n|t[0].size.do{|i|t[n][l][i].post}};
Post.nl
}
});

mcc,
@mcc@mastodon.social avatar

@lukiss Gorgeous

ramsey, to random
@ramsey@phpc.social avatar

I found what I think is an edge-case bug in ICU. It’s unlikely to impact most folks, unless you’re trying to run the ECMA-402 test suite.

https://unicode-org.atlassian.net/browse/ICU-22765

enablelanguages,

@ramsey I assume und@x=private means an undetermined language with a private subtag called "private"?

Interestingly PyICU using icu.Locale.addLikelySubtags(icu.Locale('und-x-private')) gives me
<Locale: en_Latn_US@x=private>

ramsey,
@ramsey@phpc.social avatar

@enablelanguages Right. That’s what it’s supposed to return. What version of ICU is PyICU linked against?

SnoopJ, to mahjong
@SnoopJ@hachyderm.io avatar

my recent interest in has collided with my on-going interest in as I remember that the block U+1F000 through U+1F02B are allocated for encoding tiles

🀀🀁🀂🀃🀄🀅🀆🀇🀈🀉🀊🀋🀌🀍🀎🀏🀐🀑🀒🀓🀔🀕🀖🀗🀘🀙🀚🀛🀜🀝🀞🀟🀠🀡🀢🀣🀤🀥🀦🀧🀨🀩🀪🀫

this information has no practical use to me, but it's nice that the UCS represents them

SnoopJ,
@SnoopJ@hachyderm.io avatar

I wrote a pretty dumb little program to deal a hand and I'm disappointed that I can't make it look consistent in the terminal with this font :(

onelson,
@onelson@mastodon.social avatar

@SnoopJ I thought these tiles were meant to have sexy ladies doing sexy things on them.

Wuzzy, to gamedev
@Wuzzy@cyberplace.social avatar

This is my sign test wall where I've been testing Unicode rendering on the signs in , using various strings.

🟩 green = OK
🟨 yellow = Unsupported, but renders as U+FFFD REPLACEMENT CHARACTER (also OK)
🟥 red = FAIL

I think I should maybe call it a day.

Wuzzy, to gamedev
@Wuzzy@cyberplace.social avatar

How should render (left) vs how it would render on signs (right) if I would re-enable it.

RTL support works, but that's not good enough. If I understand correctly, the glyphs also need to connect.

But then, even (the font I use) doesn't seem to have the neccessary glyph variants.

Wuzzy, to gamedev
@Wuzzy@cyberplace.social avatar

Turns out rendering is hard. No, you can't just implement the Algorithm and call it a day. It turns out the Arabic letters/symbols/? have different forms depending on where they are in the word and probably there are other non-trivial features. Yeah, I guess I just postpone this.

So my 1000 IQ workaround for now is to just render all arabic characters as U+FFFD REPLACEMENT CHARACTER for now.

  • All
  • Subscribed
  • Moderated
  • Favorites
  • JUstTest
  • mdbf
  • ngwrru68w68
  • InstantRegret
  • magazineikmin
  • thenastyranch
  • rosin
  • khanakhh
  • tacticalgear
  • Youngstown
  • slotface
  • Durango
  • kavyap
  • DreamBathrooms
  • provamag3
  • ethstaker
  • GTA5RPClips
  • modclub
  • tester
  • Leos
  • osvaldo12
  • cisconetworking
  • everett
  • cubers
  • normalnudes
  • anitta
  • megavids
  • lostlight
  • All magazines