More recreational reading for the world's 17 lovers of wrangling both finite automata and UTF-8. I previously challenged Fedi with the question: How many states does an automaton need to match any Unicode code point? Answer in the next post and explanation in the blog piece.
Matching “.” in UTF-8: tbray.org/ongoing/When/202x/20…
like this
It’s not just TikTok or Meta though. It’s us.
We tell ourselves we’re above it, that we want “real content,” but our clicking and sharing habits tell a different story. The platforms might be dealing the cards, but we’re the ones choosing to play hand after hand, shoveling this shit down our own throats and asking for seconds.
Jacob Urlich 🌍 likes this.
reshared this
The Story Behind the Making of the Iconic Surrealist Photograph, Dalí Atomicus (1948)
Users of MailMate, an outstanding Mac-only email client, will be interested in licensing changes announced in this blog post by MailMate’s developer:
Trump Previews Second Term in Sprawling Speech to Conservative Conference (Michael D. Shear/New York Times)
nytimes.com/2024/12/22/us/poli…
memeorandum.com/241222/p25#a24…
UK: Transgender woman wins court case for transfer to female prison
bbc.com/news/articles/c4g2w3q1…
#transgender #trans #LGBTQ #LGBTQIA
Tim Bray
in reply to Tim Bray • • •Sensitive content
Daphne Preston-Kendal
in reply to Tim Bray • • •I can’t comment on your blog for some reason, so I’ll answer here:
Normalization and normal forms has nothing to do with overlong UTF-8 sequences. Normalization has to do with valid code unit sequences which represent different codepoint sequences which in turn represent different ways of writing a character which humans generally consider ‘the same’.
The example Ed Davies gave is invalid UTF-8: a conforming decoder should reject it, for reasons touched on in section 10 of RFC 3629.
Tim Bray
in reply to Daphne Preston-Kendal • • •Daphne Preston-Kendal
in reply to Tim Bray • • •Tim Bray
in reply to Daphne Preston-Kendal • • •Tim Bray
in reply to Daphne Preston-Kendal • • •