Tech Note: Diacritic Characters

At PZ’s suggestion, I’ve twiddled some knobs behind the scenes to force the blog to speak utf-8 instead of iso-8859-1. This will hopefully allow you guys to write even stranger comments than usual. Maybe I’ll even be able to stop writing stuff like “& a u m l ;” Please try it out! SÃ¥y sömëthïng ïn Swëdïsh! Mattias, have you any lewd suggestions to make in Koiné Greek? Is anyone able to rattle off a few lines of Arabic love poetry? Go nuts, y’all!

Advertisements

22 thoughts on “Tech Note: Diacritic Characters

  1. “Tröjan” var skoj!

    日本語で説明しょう。「トレジャン」ってスヴェーデン語で「あのセーター」という意味だ。

    Like

  2. Let’s see…

    ◄┐
    │
    │
    └►

    Hah! It strips out the non-breaking spaces! (at least in preview mode).

    Like

  3. ◄┐
     â”‚
     â”‚
     â””â–º

    Hah! It strips out the non-breaking spaces! (at least in preview mode).

    (replacing with ampersand nbsp semicolon + not going into preview)

    Like

  4. So U+00A0 gets stripped out but nbsp stays. I can live with that. Now, if only I could enforce monotype… Maybe a code tag will work?

    ◄┐
     â”‚
     â”‚
     â””â–º

    Like

  5. Didn’t work.

    Men tusind tak — tack sÃ¥ mycket — for at du gad gøre arbejdet, nÃ¥r nu seed ikke gad 🙂

    I øvrigt, siden du som jeg selv både læser WulffMorgenthaler og Skalk (min farmor har en komplet samling helt fra starten og ubrudt hele vejen ind til for et par år siden og jeg har læst dem alle!) så vil du måske synes om det her:

    En 15-Ã¥rig knægt fra Jylland som ikke selv ved hvor god han er — og som hørte sangen (Kiss from a Rose) for første gang fem dage før showet.

    Jeg kan også anbefale Lisas udgave af Crazy og Heidis udgave af Verden er i farver fra samme show:


    Like

  6. ᚻᛖ ᚳᚹᚫᚦ ᚦᚫᛏ ᚻᛖ ᛒᚢᛞᛖ ᚩᚾ ᚦᚫᛗ ᛚᚪᚾᛞᛖ ᚾᚩᚱᚦᚹᛖᚪᚱᛞᚢᛗ ᚹᛁᚦ ᚦᚪ ᚹᛖᛥᚫ

    (Old English, which transcribed into Latin reads ‘He cwaeth that he bude thaem lande northweardum with tha Westsae.’ and means ‘He said that he lived in the northern land near the Western Sea.’)

    Taken from one of my test files, which I believe originates from the Linux UTF-8 FAQ.

    Like

  7. τοτε διηνοιξεν αυτων τον νουν του συνιεναι τας γραφας

    / Mattias

    Like

  8. Seems the diacritics situation here has improved considerably but is still not ideal. The Scandy characters work well, which takes care of most of the problems.

    Like

  9. The evermost silly play with characters, even more so than that of certain 80:s metal groups, must be the american ice-cream company Häagen-Dazs. Apparently the company took its name in order to convey ‘european quality’ to the transatlantic market. A canadian gentleman once told me that the only word he knew in Danish was ‘Häagen-Dazs’, insisting that it meant ‘quality dairy product’! The ice-cream is superb, however, and popular also in Europe these days.

    / Mattias

    Like

  10. Hopefully this might make the above comment make sense to non-swedish speakers as well.

    Skor – Plural of shoe
    Sk̦r РFragile/brittle

    Like

  11. Maybe the person who put an umlaut on “Skor” had some knowledge of german and believed that all germanic languages used these funny letters to produce pluralis (thus making a false connection between, for example, “Mann”-“Männer” and “Sko”-“Skör”)?

    / Mattias

    Like

  12. I think we should avoid the temptation to anthropomorphise the Canadians, attributing a level of rationality to them of which there really is very little evidence.

    Like

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s