DokuWiki

It's better when it's simple

User Tools

Site Tools


utf-8

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
utf-8 [2009-02-04 13:06] 87.110.177.92utf-8 [2017-11-13 02:49] (current) 114.84.118.11
Line 1: Line 1:
 ====== UTF-8 Encoding ====== ====== UTF-8 Encoding ======
  
-[[DokuWiki]] now stores all its data in UTF-8. To avoid problems, the filenames of the datafiles itself are [[phpfn>urlencode|URL-encoded]] when saved. Older versions of DokuWiki used different encodings so the datafiles need to be [[tips:utf8update|reencoded]] when the software is updated. Switching the used encoding to charsets different from UTF-8 is **not** supported.+[[DokuWiki]] now stores all its data in UTF-8. To avoid problems, the filenames of the datafiles itself are [[phpfn>urlencode|URL-encoded]] when saved. DokuWiki versions older than release 2005-02-06 used different encodings so the datafiles need to be [[tips:utf8update|reencoded]] when the software is updated. Switching the used encoding to charsets different from UTF-8 is **not** supported.
  
 ===== Browser Setup for UTF-8 ===== ===== Browser Setup for UTF-8 =====
Line 7: Line 7:
 All modern browsers do handle UTF-8 encoded web pages - it's one of the few things that actually work as expected in most browsers. If your browser doesn't display some characters correctly, you are probably missing the correct Unicode fonts. All modern browsers do handle UTF-8 encoded web pages - it's one of the few things that actually work as expected in most browsers. If your browser doesn't display some characters correctly, you are probably missing the correct Unicode fonts.
  
-Windows users should install the ''Arialuni.TTF'' font from Microsoft. It is included in Microsoft's Office Suite or can be downloaded from the web (e.g. [[coral>http://www1.uni-hamburg.de/exmaralda/Deutsch/2-Download/pe-partitur-editor.html|here]]).+Windows users should install the ''Arialuni.TTF'' font from Microsoft. It is included in Microsoft's Office Suite.
  
 [[http://debian.org|Debian]] users can read my [[notes>debianfonts|page on fonts]] to learn how to install Unicode fonts correctly. [[http://debian.org|Debian]] users can read my [[notes>debianfonts|page on fonts]] to learn how to install Unicode fonts correctly.
Line 31: Line 31:
  
   * On Unix use http://www.gnu.org/software/libiconv/   * On Unix use http://www.gnu.org/software/libiconv/
- +  * On Window use recode, a port of iconv: http://recode.progiciels-bpi.ca/archives 
-  * On Window use recode, a port of iconv : http://recode.progiciels-bpi.ca/archives +    * Example of a simple conversion for french local computer:<code>recode lat1..u8 test.txt</code>with ''lat1'' the source charset and ''u8'' the conversion charset for UTF-8. 
-    * Example of a simple conversion for french local computer : +    * To batch the conversion on Windows use this (conversion of all the files in sub-directory)<code>FOR /F "tokens=*" %%G IN ('dir/b/S/X ^"C:\yourpath\*.txt^"') DO recode -v lat1..u8 %%~sG</code> 
-<code> recode lat1..u8 test.txt </code> +  More explanation there: [[http://jeanmarcmassou.free.fr/dokuwiki/doku.php?id=blogw:batch_on_win_for_utf_conversion|the link]]
-with lat1 the source charset and u8 the conversion charset for UTF-8. +
- +
-    * To batch the conversion on Windows use this (conversion of all the files in sub-directory) +
- +
-<code> +
- +
-FOR /F "tokens=*" %%G IN ('dir/b/S/X ^"C:\yourpath\*.txt^"') DO recode -v lat1..u8 %%~sG +
- +
-</code> +
- +
-More explanation there : [[http://jeanmarcmassou.free.fr/dokuwiki/doku.php?id=blogw:batch_on_win_for_utf_conversion|the link]]+
  
  
Line 78: Line 67:
  
 Αρχαίο Πνεύμα Αθάνατον!   Αρχαίο Πνεύμα Αθάνατον!  
-Ἰοὺ ἰού· τὰ πντʼ ἂν ἐξκοι σαφῆ. +Ἰοὺ ἰού· τὰ πάντʼ ἂν ἐξήκοι σαφῆ. 
-  Ὦ φῶς, τελευταῖν σε προσϐλψαιμι νῦν, +  Ὦ φῶς, τελευταῖόν σε προσϐλέψαιμι νῦν, 
-  ὅστις πφασμαι φς τʼ ἀφʼ ὧν οὐ χρῆν, ξὺν οἷς τʼ +  ὅστις πέφασμαι φύς τʼ ἀφʼ ὧν οὐ χρῆν, ξὺν οἷς τʼ 
-  οὐ χρῆν ὁμιλῶν, οὕς τέ μʼ οὐκ ἔδει κτανν.+  οὐ χρῆν ὁμιλῶν, οὕς τέ μʼ οὐκ ἔδει κτανών.
  
 Modern Greek: Modern Greek:
Line 105: Line 94:
   곱고 아름답게 사용하는 것이 우리의 의무입니다.   곱고 아름답게 사용하는 것이 우리의 의무입니다.
  
-Chinese:+Traditional Chinese:
  
   子曰:「學而時習之,不亦說乎?有朋自遠方來,不亦樂乎?   子曰:「學而時習之,不亦說乎?有朋自遠方來,不亦樂乎?
Line 113: Line 102:
   不好犯上,而好作亂者,未之有也。君子務本,本立而道生。   不好犯上,而好作亂者,未之有也。君子務本,本立而道生。
   孝弟也者,其為仁之本與!」   孝弟也者,其為仁之本與!」
 +  
 + Simplified Chinese:
 +
 +  子曰:「学而时习之,不亦说乎?有朋自远方来,不亦乐乎?
 +  人不知而不慍,不亦君子乎?」
 +  
 +  有子曰:「其为人也孝弟,而好犯上者,鲜矣;
 +  不好犯上,而好作乱者,未之有也。君子务本,本立而道生。
 +  孝弟也者,其为仁之本与!」
  
 Japanese: Japanese:
Line 129: Line 127:
   Glāžšķūņa rūķīši jautri dziedādami čiepj koncertflīģeļa vāku.    Glāžšķūņa rūķīši jautri dziedādami čiepj koncertflīģeļa vāku. 
  
-Simplified Chinese:+Simplified Chinese :
  
-  这是简体字汉语。 zhè shì jiǎn t zì hàn yǔ +  这是简体字汉语zhè shì jiǎn tǐ zì hàn yǔ 
  
 Armenian: Armenian:
Line 138: Line 136:
   Ամենալավ օրենքները չեն օգնի, եթե մարդիկ բանի պետք չեն:   Ամենալավ օրենքները չեն օգնի, եթե մարդիկ բանի պետք չեն:
  
 +Persian:
 +
 +  بنی‌آدم اعضای یک‌دیگرند / که در آفرینش ز یک گوهرند
 +  
 Hebrew: Hebrew:
  
   המשפט עם הזכוכית שאפשר לאכול בלי שזה מפריע, לא זוכר איך הוא הולך   המשפט עם הזכוכית שאפשר לאכול בלי שזה מפריע, לא זוכר איך הוא הולך
- 
utf-8.txt · Last modified: 2017-11-13 02:49 by 114.84.118.11

Except where otherwise noted, content on this wiki is licensed under the following license: CC Attribution-Share Alike 4.0 International
CC Attribution-Share Alike 4.0 International Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki