TROM Social | Community

More recreational reading for the world's 17 lovers of wrangling both finite automata and UTF-8. I previously challenged Fedi with the question: How many states does an automaton need to match any Unicode code point? Answer in the next post and explanation in the blog piece.

Matching “.” in UTF-8: tbray.org/ongoing/When/202x/20…

in reply to Tim Bray

Tim Bray

in reply to Tim Bray • 1 year ago • •

Concealed answer

Sensitive content

in reply to Tim Bray

Daphne Preston-Kendal

in reply to Tim Bray • 1 year ago • •

I can’t comment on your blog for some reason, so I’ll answer here:

Normalization and normal forms has nothing to do with overlong UTF-8 sequences. Normalization has to do with valid code unit sequences which represent different codepoint sequences which in turn represent different ways of writing a character which humans generally consider ‘the same’.

The example Ed Davies gave is invalid UTF-8: a conforming decoder should reject it, for reasons touched on in section 10 of RFC 3629.

in reply to Daphne Preston-Kendal

Tim Bray

in reply to Daphne Preston-Kendal • 1 year ago • •

@dpk Having 5 states doesn't imply dealing with anything out of the usual 1-to-4 byte UTF-8 encoding range. Ther are multiple paths through the automaton.

@Daphne Preston-Kendal

in reply to Tim Bray

Daphne Preston-Kendal

in reply to Tim Bray • 1 year ago • •

Please take another look at Ed Davies’s comment. The problem is codepoints being represented in more bytes than the minimum for the encoding of that codepoint. The letter ‘a’ MUST only be representable by the single byte x61. The two bytes xC1 xA1 will decode to the same codepoint in a naïve decder, but MUST be rejected by a conforming decoder. This happens *within* the 2 to 4 byte long range of 20.whatever-bit UTF-8.

in reply to Daphne Preston-Kendal

Tim Bray

in reply to Daphne Preston-Kendal • 1 year ago • •

@dpk Oh, you're entirely right. My tiny automaton needs changing to omit C0 and C1 mappings from the Start State. Will fix and credit. Thanks!

@Daphne Preston-Kendal

in reply to Daphne Preston-Kendal

Tim Bray

in reply to Daphne Preston-Kendal • 1 year ago • •

@dpk Cool! Looks like a tasty source of test cases.

@Daphne Preston-Kendal

MostlyHarmless

1 year ago • •

MostlyHarmless
1 year ago • •

in reply to MostlyHarmless

Talya (she/her) 🏳️‍⚧️✡️

in reply to MostlyHarmless • 1 year ago • •

repost with alt text, Elon Musk's father

Sensitive content

#alt4u

The Collector™

1 year ago • •

The Collector™
1 year ago • •

Putin and Fico Hold One-on-One Meeting in Moscow to Discuss Key Issues sputnikglobe.com/20241222/puti… MOSCOW (Sputnik) - Russian President Vladimir Putin has received Slovak Prime Minister Robert Fico at the Kremlin, who is in Moscow on a working visit, Kremlin spokesman Dmitry Peskov said on Sunday. #news #press

#News #press

PossumEveryHour

1 year ago • •

PossumEveryHour
1 year ago • •

Adam Hunt

1 year ago • •

Adam Hunt
1 year ago • •

Seen in #Ottawa: "no gloves allowed".

#Ottawa

like this

in reply to Adam Hunt

clarice overhere

in reply to Adam Hunt • 1 year ago • •

or no waving, no hi-5s and dont touch the poles (but licking them is fine)

like this

in reply to Adam Hunt

Adam Hunt

in reply to Adam Hunt • 1 year ago • •

LOL. The signs are very ambiguous!

Here is what I do know. These three plastic posts are part of a row of about 20 posts at a light rail station in Ottawa. The trains we have, Alstom Citadis Spirit trains, are normally run with two trains coupled together. They stop at the station so the join is behind this row of posts, since you can't board there, since obviously humans are too stupid to figure out that you should enter one car or the other by the doors and not just stand on the coupling mechanism. The posts are flexibly mounted at their bases, because otherwise I am sure that people would lean on them or try to sit on them or something, even though they are at the very edge of the platform. That is a rail you can see behind them. The problem is, because they are flexibly mounted, if you do lean on them they will flex and you will probably just fall onto the tracks or similar. I note that even a baboon would not do that, only an intensely stupid human would. The signs are not clear but I assume they just mean "do not touch", although they could be interpreted mean "no glov

Richard likes this.

An🎃🎃sh

1 year ago • •

An🎃🎃sh
1 year ago • •

𝕕𝕚𝕒𝕟𝕖𝕒 🏳️‍⚧️🦋

1 year ago • •

𝕕𝕚𝕒𝕟𝕖𝕒 🏳️‍⚧️🦋
1 year ago • •

like this

in reply to 𝕕𝕚𝕒𝕟𝕖𝕒 🏳️‍⚧️🦋

Bellarome

in reply to 𝕕𝕚𝕒𝕟𝕖𝕒 🏳️‍⚧️🦋 • 1 year ago • •

Interesting, how the wealthy came about their wealth. Would that be emerald mines in South Africa ??

Eugene McParland 🇺🇦

1 year ago • •

Eugene McParland 🇺🇦
1 year ago • •

#BMW has sacked employees from a branch in Hanover for exporting its cars to russia.

An investigation found more than 100 premium cars had been illegally given to russians despite #sanctions.

evrimagaci.org/tpg/bmw-halts-v…

#BMW #sanctions

This entry was edited (1 year ago)

in reply to Eugene McParland 🇺🇦

João Costa 💚🇵🇹🇪🇺🇬🇧🇺🇦

in reply to Eugene McParland 🇺🇦 • 1 year ago • •

Well done 👍👍👍 🇩🇪🤝🇺🇦

Kplx

1 year ago • •

Kplx
1 year ago • •

Besinnt euch alle mal schnell!
kplx.art/hls

Eine A 4 Seite erzählt hektisch eine Weihnachtsgeschichte von Charles Dickens

The Collector™

1 year ago • •

The Collector™
1 year ago • •

Interview der Woche Weihnachten, Liebe & Familie: „Zum Fest der Tante eine reinwürgen!“ jungefreiheit.de/debatte/inter… Zum Weihnachtsfest wird in den Medien wieder vor „Schwurbel-Opa“ oder dem „AfD-Onkel“ gewarnt. Doch was, wenn die Grünen-Tante zeternd den Familienfrieden stört? Der Psychiater Christian Spaemann – Sohn des bekannten Philosophen Robert Speamann – erklärt im JF-Interview, wie Sie die Feiertage dennoch

JA Westenberg

1 year ago • •

JA Westenberg
1 year ago • •

It’s not just TikTok or Meta though. It’s us.

We tell ourselves we’re above it, that we want “real content,” but our clicking and sharing habits tell a different story. The platforms might be dealing the cards, but we’re the ones choosing to play hand after hand, shoveling this shit down our own throats and asking for seconds.

medium.com/westenberg/want-to-…

Jacob Urlich 🌍 likes this.

reshared this

in reply to JA Westenberg

Jacob Urlich 🌍

in reply to JA Westenberg • 1 year ago •

A very simmilar story in 1000 pages trom book- the origin of the most problems. The same ironic story about music industry - you can find at tromsite books - which are trade-free for everyone.

Glyn Moody

1 year ago • •

Glyn Moody
1 year ago • •

Addressing Crowds, President #Zurabishvili Summons #Ivanishvili to Negotiate New Elections - civil.ge/archives/647521 well, that's bold, but not sure he will come... #georgia

#Georgia #ivanishvili #zurabishvili

Open Culture (Official)

1 year ago • •

Open Culture (Official)
1 year ago • •

The Story Behind the Making of the Iconic Surrealist Photograph, Dalí Atomicus (1948)

openculture.com/2024/12/the-st…

Bryce Wray

1 year ago • •

Bryce Wray
1 year ago • •

Users of MailMate, an outstanding Mac-only email client, will be interested in licensing changes announced in this blog post by MailMate’s developer:

blog.freron.com/2024/new-licen…

#email #license #subscription

Glyn Moody

1 year ago • •

Glyn Moody
1 year ago • •

Energy Prices Drop Below Zero In UK Thanks To Record Wind-Generated Electricity - hardware.slashdot.org/story/24… clever old wind #renewables

#renewables

in reply to Glyn Moody

Simon Greenwood

in reply to Glyn Moody • 1 year ago • •

There is some grim irony of course, as increased high wind is in part due to climate change.

in reply to Simon Greenwood

Glyn Moody

in reply to Simon Greenwood • 1 year ago • •

@simon it is; at least we get some benefit from this worsening situation...

@Simon Greenwood

memeorandum

1 year ago • •

memeorandum
1 year ago • •

Trump Previews Second Term in Sprawling Speech to Conservative Conference (Michael D. Shear/New York Times)

nytimes.com/2024/12/22/us/poli…
memeorandum.com/241222/p25#a24…

The Collector™

1 year ago • •

The Collector™
1 year ago • •

Gedanken des Balkonisten – Über scheinbaren "Wahlkampf" und humorlose Hypersensibilität de.rt.com/meinung/230350-gedan… Was stimmt nur mit diesem Wahlkampf nicht, der so gar nicht wie ein Wahlkampf wirken will? Und warum sind diese "Eliten" so empfindlich? Unser Balkonist hat sich seine Gedanken gemacht – und ist dabei bei Goethe gelandet. Das letzte Wort hat dieses Mal der Kater. #news #press

#News #press

Transgender World

1 year ago • •

Transgender World
1 year ago • •

UK: Transgender woman wins court case for transfer to female prison

bbc.com/news/articles/c4g2w3q1…

#transgender #trans #LGBTQ #LGBTQIA

#LGBTQ #transgender #trans #lgbtqia

Electronic Frontier Foundation

1 year ago • •

Electronic Frontier Foundation
1 year ago • •

Calls for the UK’s Prime Minister and Foreign Secretary to secure the release of activist Alaa Abd El-Fattah are building from many directions. They must do more to ensure Alaa’s immediate and unconditional release. eff.org/deeplinks/2024/12/uk-p…

Misty

1 year ago • •

Misty
1 year ago • •

Rodem the Wild (iTAChoco Systems, Mac, 1994)

Scerenshot of a sidescrolling video game with a vey strange-looking hand-drawn dog.

⇧

Tim Bray 1 year ago • •

MostlyHarmless 1 year ago • •

The Collector™ 1 year ago • •

PossumEveryHour 1 year ago • •

Adam Hunt 1 year ago • •

An🎃🎃sh 1 year ago • •

𝕕𝕚𝕒𝕟𝕖𝕒 🏳️‍⚧️🦋 1 year ago • •

Eugene McParland 🇺🇦 1 year ago • •

Kplx 1 year ago • •

The Collector™ 1 year ago • •

JA Westenberg 1 year ago • •

Glyn Moody 1 year ago • •

Open Culture (Official) 1 year ago • •

Bryce Wray 1 year ago • •

Glyn Moody 1 year ago • •

memeorandum 1 year ago • •

The Collector™ 1 year ago • •

Transgender World 1 year ago • •

Electronic Frontier Foundation 1 year ago • •

Misty 1 year ago • •

Tim Bray
1 year ago • •

MostlyHarmless
1 year ago • •

The Collector™
1 year ago • •

PossumEveryHour
1 year ago • •

Adam Hunt
1 year ago • •

An🎃🎃sh
1 year ago • •

𝕕𝕚𝕒𝕟𝕖𝕒 🏳️‍⚧️🦋
1 year ago • •

Eugene McParland 🇺🇦
1 year ago • •

Kplx
1 year ago • •

The Collector™
1 year ago • •

JA Westenberg
1 year ago • •

Glyn Moody
1 year ago • •

Open Culture (Official)
1 year ago • •

Bryce Wray
1 year ago • •

Glyn Moody
1 year ago • •

memeorandum
1 year ago • •

The Collector™
1 year ago • •

Transgender World
1 year ago • •

Electronic Frontier Foundation
1 year ago • •

Misty
1 year ago • •