PHP : Function Reference : XML Parser Functions : utf8_encode PHP Examples Tutorials References

aidan kehoe

Here's some code that addresses the issue that Steven describes in the previous comment; <?php /* This structure encodes the difference between ISO-8859-1 and Windows-1252, as a map from the UTF-8 encoding of some ISO-8859-1 control characters to the UTF-8 encoding of the non-control characters that Windows-1252 places at the equivalent code points. */ $cp1252_map = array( "\xc2\x80" => "\xe2\x82\xac", /* EURO SIGN */ "\xc2\x82" => "\xe2\x80\x9a", /* SINGLE LOW-9 QUOTATION MARK */ "\xc2\x83" => "\xc6\x92", /* LATIN SMALL LETTER F WITH HOOK */ "\xc2\x84" => "\xe2\x80\x9e", /* DOUBLE LOW-9 QUOTATION MARK */ "\xc2\x85" => "\xe2\x80\xa6", /* HORIZONTAL ELLIPSIS */ "\xc2\x86" => "\xe2\x80\xa0", /* DAGGER */ "\xc2\x87" => "\xe2\x80\xa1", /* DOUBLE DAGGER */ "\xc2\x88" => "\xcb\x86", /* MODIFIER LETTER CIRCUMFLEX ACCENT */ "\xc2\x89" => "\xe2\x80\xb0", /* PER MILLE SIGN */ "\xc2\x8a" => "\xc5\xa0", /* LATIN CAPITAL LETTER S WITH CARON */ "\xc2\x8b" => "\xe2\x80\xb9", /* SINGLE LEFT-POINTING ANGLE QUOTATION */ "\xc2\x8c" => "\xc5\x92", /* LATIN CAPITAL LIGATURE OE */ "\xc2\x8e" => "\xc5\xbd", /* LATIN CAPITAL LETTER Z WITH CARON */ "\xc2\x91" => "\xe2\x80\x98", /* LEFT SINGLE QUOTATION MARK */ "\xc2\x92" => "\xe2\x80\x99", /* RIGHT SINGLE QUOTATION MARK */ "\xc2\x93" => "\xe2\x80\x9c", /* LEFT DOUBLE QUOTATION MARK */ "\xc2\x94" => "\xe2\x80\x9d", /* RIGHT DOUBLE QUOTATION MARK */ "\xc2\x95" => "\xe2\x80\xa2", /* BULLET */ "\xc2\x96" => "\xe2\x80\x93", /* EN DASH */ "\xc2\x97" => "\xe2\x80\x94", /* EM DASH */ "\xc2\x98" => "\xcb\x9c", /* SMALL TILDE */ "\xc2\x99" => "\xe2\x84\xa2", /* TRADE MARK SIGN */ "\xc2\x9a" => "\xc5\xa1", /* LATIN SMALL LETTER S WITH CARON */ "\xc2\x9b" => "\xe2\x80\xba", /* SINGLE RIGHT-POINTING ANGLE QUOTATION*/ "\xc2\x9c" => "\xc5\x93", /* LATIN SMALL LIGATURE OE */ "\xc2\x9e" => "\xc5\xbe", /* LATIN SMALL LETTER Z WITH CARON */ "\xc2\x9f" => "\xc5\xb8" /* LATIN CAPITAL LETTER Y WITH DIAERESIS*/ ); function cp1252_to_utf8($str) { global $cp1252_map; return strtr(utf8_encode($str), $cp1252_map); } ?>

migueldiaz

Here's my function to know if one string is encoded in UTF8. If we encode in UTF8 a string or text file that is already encoded in UTF8, it's expected to find the character 'ƒ' ( ALT+159) in the final string. <?php function isUTF8($string) { $string_utf8 = utf8_encode($string); if( strpos($string_utf8,"ƒ",0) !== false ) // "ƒ" is ALT+159 return true; // the original string was utf8 else return false; // otherwise } ?> regards Miguel Díaz

http://iubito.free.fr

Here's a function I made to know if one string or textfile is already encoded in UTF8 : <?php /** * Returns <kbd>true</kbd> if the string or array of string is encoded in UTF8. * * Example of use. If you want to know if a file is saved in UTF8 format : * <code> $array = file('one file.txt'); * $isUTF8 = isUTF8($array); * if (!$isUTF8) --> we need to apply utf8_encode() to be in UTF8 * else --> we are in UTF8 :) * </code> * @param mixed A string, or an array from a file() function. * @return boolean */ function isUTF8($string) { if (is_array($string)) { $enc = implode('', $string); return @!((ord($enc[0]) != 239) && (ord($enc[1]) != 187) && (ord($enc[2]) != 191)); } else { return (utf8_encode(utf8_decode($string)) == $string); } } ?>

romans

Here is optimized function which converts binary UTF symbol code into unicoded string. function code2utf($num){ if($num<128)return chr($num); if($num<1024)return chr(($num>>6)+192).chr(($num&63)+128); if($num<32768)return chr(($num>>12)+224).chr((($num>>6)&63)+128).chr(($num&63)+128); if($num<2097152)return chr($num>>18+240).chr((($num>>12)&63)+128).chr(($num>>6)&63+128). chr($num&63+128); return ''; }

bmorel

Here is an improved version of that function, compatible with 31-bit encoding scheme of Unicode 3.x : <?php function seems_utf8($Str) { for ($i=0; $i<strlen($Str); $i++) { if (ord($Str[$i]) < 0x80) continue; # 0bbbbbbb elseif ((ord($Str[$i]) & 0xE0) == 0xC0) $n=1; # 110bbbbb elseif ((ord($Str[$i]) & 0xF0) == 0xE0) $n=2; # 1110bbbb elseif ((ord($Str[$i]) & 0xF8) == 0xF0) $n=3; # 11110bbb elseif ((ord($Str[$i]) & 0xFC) == 0xF8) $n=4; # 111110bb elseif ((ord($Str[$i]) & 0xFE) == 0xFC) $n=5; # 1111110b else return false; # Does not match any model for ($j=0; $j<$n; $j++) { # n bytes matching 10bbbbbb follow ? if ((++$i == strlen($Str)) || ((ord($Str[$i]) & 0xC0) != 0x80)) return false; } } return true; } ?>

mualem_i

Hebrew!! What a language. I had some trouble placing the Hebrew in a javascript based drop down menu, the text appeared as UTF8 so I made this function to overcome the problem (Not talking about efficiency) function rtf_heb($string) { $array = split (" ",$string) ; foreach ($array as $VAL) { $VAL = str_replace("&#1488","à",$VAL); $VAL = str_replace("&#1489","á",$VAL); $VAL = str_replace("&#1490","â",$VAL); $VAL = str_replace("&#1491","ã",$VAL); $VAL = str_replace("&#1492","ä",$VAL); $VAL = str_replace("&#1493","å",$VAL); $VAL = str_replace("&#1494","æ",$VAL); $VAL = str_replace("&#1495","ç",$VAL); $VAL = str_replace("&#1496","è",$VAL); $VAL = str_replace("&#1497","é",$VAL); $VAL = str_replace("&#1499","ë",$VAL); $VAL = str_replace("&#1500","ì",$VAL); $VAL = str_replace("&#1502","î",$VAL); $VAL = str_replace("&#1504","ð",$VAL); $VAL = str_replace("&#1505","ñ",$VAL); $VAL = str_replace("&#1506","ò",$VAL); $VAL = str_replace("&#1508","ô",$VAL); $VAL = str_replace("&#1510","ö",$VAL); $VAL = str_replace("&#1511","÷",$VAL); $VAL = str_replace("&#1512","ø",$VAL); $VAL = str_replace("&#1513","ù",$VAL); $VAL = str_replace("&#1514","ú",$VAL); $VAL = str_replace("&#1498","ê",$VAL); $VAL = str_replace("&#1507","ó",$VAL); $VAL = str_replace("&#1503","ï",$VAL); $VAL = str_replace("&#1501","í",$VAL); $VAL = str_replace("&#1509","õ",$VAL); $VAL = str_replace(";","",$VAL); $send_VAR .= $VAL." "; } return $send_VAR; }

lorro

Good news is that utf8_encode (like UTF-8) passes '<', '>', '/', '\'', '"', etc., so you are free to utf8_encode complete blocks of html text that includes tags. Bad news is that UTF-8 is stupid enough so that utf8_encode(utf8_encode($str)) != utf8_encode($str) in most of the cases. What you can do is write utf8_ensure like: function utf8_ensure($str) { return seems_utf8($str)? $str: utf8_encode($str); } Comes handy when your view library tries to encode the same text multiple times.

27-aug-2002 07:30

For XML generation, if you want non-ASCII ISO-8859-1 characters within text and attributes, you don't absolutely need UTF-8 encoding: The optional XML declaration can change the default encoding for characters from UTF-8 to ISO-8859-1: <?xml version="1.0" encoding="iso-8859-1" ?> This can save a lot of PHP code if you just want to generate ISO-8859-1 text and attribute values... XML specification requires that all parsers support both the UTF-8 encoding (by default), and the ISO-8859-1 character set. Other character sets may be supported also by specifying them in the encoding attribute of the leading XML declaration (but the target parser must support this character set to allow automatic conversion of the source text into Unicode character entities.

bisqwit

For reference, it may be insightful to point out that: utf8_encode($s) is actually identical to: recode_string('latin1..utf8', $s) and: iconv('iso-8859-1', 'utf-8', $s) That is, utf8_encode is a specialized case of character set conversions. If your string to be converted to utf-8 is something other than iso-8859-1 (such as iso-8859-2 (Polish/Croatian)), you should use recode_string() or iconv() instead rather than trying to devise complex str_replace statements.

penda ekoka

creating utf-8 xml files: this is something that has wasted a lot of my time, I hope this will spare you the headaches: my method consists of creating an xml template that will look like this (this is probably optional, I'm sure you can use good ol' print or echo statements): xml_tpl.php <?php header("Content-Type: text/html;charset=ISO-8859-1"); print "<?xml version=\"1.0\" encoding=\"UTF-8\" ?>\n"; $names=array('jack','bob','vanessa','catherine','valerie'); ?> <parent> <?php foreach($names as $name) {?> <child name="<?php print $name?>" /> <?php } ?> </parent> ?> from a function or a method I include the previous template and trap the outputted content in an output buffer. The buffured content is then inserted into a file: <?php function create_xml(){ ob_start(); include "xml_php.php"; $trapped_content=ob_get_contents(); ob_end_clean(); $file_path= "./somefile.xml"; $file_handle=fopen($somefile,'w'); fwrite($file_handle,utf8_encode($trapped_content)); } ?> Some side notes: - note that the utf8_encode function goes inside the fwrite() function. - when troubleshooting, make sure to transfer text file (xml included) and scripts in ascii mode when using ftp. For some unknown reason my ftp client did not have xml set as an ascii transfer candidate and was automatically tranfering them in binary. That little "feature" ended up costing me hours of frustration, as the encoding information would just "vanish" between transfer and I kept scratching my head as to why manually created utf8 files were not behaving as they should.

rbotzer

BTW, the 21-bit range is pretty old news. Unicode 3.x uses a 31bit encoding scheme that allows for 2 billion characters. I'll post an enhanced encoder soon. In the meanwhile here's the current encoding scheme: http://www.cl.cam.ac.uk/~mgk25/unicode.html#utf-8 Ronen

anonymous

A few bugs in your example code: function code2utf($num){ if($num<128)return chr($num); if($num<2048)return chr(($num>>6)+192).chr(($num&63)+128); if($num<65536)return chr(($num>>12)+224).chr((($num>>6)&63)+128).chr(($num&63)+128); if($num<2097152)return chr(($num>>18)+240).chr((($num>>12)&63)+128).chr((($num>>6)&63)+128) .chr(($num&63)+128); return ''; }

hrpeters

// Validate Unicode UTF-8 Version 4 // This function takes as reference the table 3.6 found at http://www.unicode.org/versions/Unicode4.0.0/ch03.pdf // It also flags overlong bytes as error function is_validUTF8($str) { // values of -1 represent disalloweded values for the first bytes in current UTF-8 static $trailing_bytes = array ( 0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0, 0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0, 0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0, 0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0, 0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0, 0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0, 0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0, 0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0, -1,-1,-1,-1,-1,-1,-1,-1,-1,-1,-1,-1,-1,-1,-1,-1, -1,-1,-1,-1,-1,-1,-1,-1,-1,-1,-1,-1,-1,-1,-1,-1, -1,-1,-1,-1,-1,-1,-1,-1,-1,-1,-1,-1,-1,-1,-1,-1, -1,-1,-1,-1,-1,-1,-1,-1,-1,-1,-1,-1,-1,-1,-1,-1, -1,-1,1,1,1,1,1,1,1,1,1,1,1,1,1,1, 1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1, 2,2,2,2,2,2,2,2,2,2,2,2,2,2,2,2, 3,3,3,3,3,-1,-1,-1,-1,-1,-1,-1,-1,-1,-1,-1 ); $ups = unpack('C*', $str); if (!($aCnt = count($ups))) return true; // Empty string *is* valid UTF-8 for ($i = 1; $i <= $aCnt;) { if (!($tbytes = $trailing_bytes[($b1 = $ups[$i++])])) continue; if ($tbytes == -1) return false; $first = true; while ($tbytes > 0 && $i <= $aCnt) { $cbyte = $ups[$i++]; if (($cbyte & 0xC0) != 0x80) return false; if ($first) { switch ($b1) { case 0xE0: if ($cbyte < 0xA0) return false; break; case 0xED: if ($cbyte > 0x9F) return false; break; case 0xF0: if ($cbyte < 0x90) return false; break; case 0xF4: if ($cbyte > 0x8F) return false; break; default: break; } $first = false; } $tbytes--; } if ($tbytes) return false; // incomplete sequence at EOS } return true; }

04-nov-2005 10:34

// Reads a file story.txt ascii (as typed on keyboard) // converts it to Georgian character using utf8 encoding // if I am correct(?) just as it should be when typed on Georgian computer // it outputs it as an html file // // http://www.comweb.nl/keys_to_georgian.html // http://www.comweb.nl/keys_to_georgian.php // http://www.comweb.nl/story.txt <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd"> <HTML> <HEAD> <TITLE>keys to unicode code</TITLE> // this meta tag is needed <meta http-equiv="Content-Type" content="text/html;charset=utf-8" > // note the sylfean font seems to be standard installed on Windows XP // It supports Georgian <style TYPE="text/css">  </style> </HEAD> <BODY> <? $eng=array(97,98,99,100,101,102,103,104,105,106,107,108,109,110,111, 112,113,114,115,116,117,118,119,120,121,122,87,82,84,83, 67,74,90); $geo=array(4304,4305,4330,4307,4308,4324,4306,4336,4312,4335,4313, 4314,4315,4316,4317,4318,4325,4320,4321,4322,4323,4309, 4332,4334,4327,4310,4333,4326,4311,4328,4329,4319,4331, 91,93,59,39,44,46,96); $fc=file("story.txt"); foreach($fc as $line) { $spacestart=1; for ($i=0; $i<strlen($line); $i+=1) { $character=ord(substr($line,$i,1)); $found=0; for ($k=0; $k<count($eng); $k+=1) { if ($eng[$k]==$character) { print code2utf( $geo[$k] ); $found=1; } } if ($found==0) { if ($character==126 || $character==32 || $character==10 || $character==9) { if ($character==9) { print '     '; } if ($character==10) { print " \n"; } if ($character==32) { if ($spacestart==1) {print ' '; } else { print " "; } } if ($character==126){ print "~"; } } else { print substr($line,$i,1); } } if ($character!=32) { $spacestart=0; } } } /** * Function coverts number of utf char into that character. * Function taken from: http://sk2.php.net/manual/en/function.utf8-encode.php#49336 * * @param int $num * @return utf8char */ function code2utf($num) { if($num<128)return chr($num); if($num<2048)return chr(($num>>6)+192).chr(($num&63)+128); if($num<65536)return chr(($num>>12)+224).chr((($num>>6)&63)+128).chr(($num&63)+128); if($num<2097152)return chr(($num>>18)+240).chr((($num>>12)&63)+128).chr((($num>>6)&63)+128) .chr(($num&63)+128); return ''; } ?> </BODY> </HTML>

sunish_mv

/*Here I have a class that will convert ISCII (Indian Standard Code for Information Interchange) devnagiri (Hindi) string to unicode string. /* <?php class iscii2utf8 { var $map; function iscii2utf8() { $this->map = array ( "a0" => '63' , "a1" => '2305' , "a2" => '2306' , "a3" => '2307' , "a4" => '2309' , "a5" => '2310' , "a6" => '2311' , "a7" => '2312' , "a8" => '2313' , "a9" => '2314' , "aa" => '2315' , "ab" => '2318' , "ac" => '2319' , "ad" => '2320' , "ae" => '2317' , "af" => '2322' , "b0" => '2323' , "b1" => '2324' , "b2" => '2321' , "b3" => '2325' , "b4" => '2326' , "b5" => '2327' , "b6" => '2328' , "b7" => '2329' , "b8" => '2330' , "b9" => '2331' , "ba" => '2332' , "bb" => '2333' , "bc" => '2334' , "bd" => '2335' , "be" => '2336' , "bf" => '2337' , "c0" => '2338' , "c1" => '2339' , "c2" => '2340' , "c3" => '2341' , "c4" => '2342' , "c5" => '2343' , "c6" => '2344' , "c7" => '2345' , "c8" => '2346' , "c9" => '2347' , "ca" => '2348' , "cb" => '2349' , "cc" => '2350' , "cd" => '2351' , "ce" => '2399' , "cf" => '2352' , "d0" => '2353' , "d1" => '2354' , "d2" => '2355' , "d3" => '2356' , "d4" => '2357' , "d5" => '2358' , "d6" => '2359' , "d7" => '2360' , "d8" => '2361' , "d9" => '63' , "da" => '2366' , "db" => '2367' , "dc" => '2368' , "dd" => '2369' , "de" => '2370' , "df" => '2371' , "e0" => '2374' , "e1" => '2375' , "e2" => '2376' , "e3" => '2373' , "e4" => '2378' , "e5" => '2379' , "e6" => '2380' , "e7" => '2377' , "e8" => '2381' , "e9" => '63' , "ea" => '2404' , "eb" => '63' , "ec" => '63' , "ed" => '63' , "ee" => '63' , "ef" => '63' , "f0" => '63' , "f1" => '2406' , "f2" => '2407' , "f3" => '2408' , "f4" => '2409' , "f5" => '2410' , "f6" => '2411' , "f7" => '2412' , "f8" => '2413' , "f9" => '2414' , "fa" => '2415' , "fb" => '63' , "fc" => '63' , "fd" => '63' , "fe" => '63' , "ff" => '63' ,); } function code2utf($num){ //Returns the utf string corresponding to the unicode value //courtesy - romans@void.lv if($num<128)return chr($num); if($num<1024)return chr(($num>>6)+192).chr(($num&63)+128); if($num<32768)return chr(($num>>12)+224).chr((($num>>6)&63)+128).chr(($num&63)+128); if($num<2097152)return chr($num>>18+240).chr((($num>>12)&63)+128).chr(($num>>6)&63+128). chr($num&63+128); return ''; } function convertstring($iscii) { //Returs utf8 string equibalent of given iscii string $str = ""; for($i = 0; $i<strlen($iscii); $i++) { $c = dechex(ord(substr($iscii,$i,1))); if (isset($this->map[$c] )) { $s = $this->code2utf($this->map[$c]); $str .= ($s == "?")?"":$s; } else { $str .= substr($iscii,$i,1); } } return $str; } } ?>

emze

/* Every function seen so far is incomplete or resource consumpting. Here are two -- integer 2 utf sequence (i3u) and utf sequence to integer (u3i). Below is a code snippet that checks well behavior at the range boundaries. Someday they might be hardcoded into PHP... */ function i3u($i) { // returns UCS-16 or UCS-32 to UTF-8 from an integer $i=(int)$i; // integer? if ($i<0) return false; // positive? if ($i<=0x7f) return chr($i); // range 0 if (($i & 0x7fffffff) <> $i) return '?'; // 31 bit? if ($i<=0x7ff) return chr(0xc0 | ($i >> 6)) . chr(0x80 | ($i & 0x3f)); if ($i<=0xffff) return chr(0xe0 | ($i >> 12)) . chr(0x80 | ($i >> 6) & 0x3f) . chr(0x80 | $i & 0x3f); if ($i<=0x1fffff) return chr(0xf0 | ($i >> 18)) . chr(0x80 | ($i >> 12) & 0x3f) . chr(0x80 | ($i >> 6) & 0x3f) . chr(0x80 | $i & 0x3f); if ($i<=0x3ffffff) return chr(0xf8 | ($i >> 24)) . chr(0x80 | ($i >> 18) & 0x3f) . chr(0x80 | ($i >> 12) & 0x3f) . chr(0x80 | ($i >> 6) & 0x3f) . chr(0x80 | $i & 0x3f); return chr(0xfc | ($i >> 30)) . chr(0x80 | ($i >> 24) & 0x3f) . chr(0x80 | ($i >> 18) & 0x3f) . chr(0x80 | ($i >> 12) & 0x3f) . chr(0x80 | ($i >> 6) & 0x3f) . chr(0x80 | $i & 0x3f); } function u3i($s,$strict=1) { // returns integer on valid UTF-8 seq, NULL on empty, else FALSE // NOT strict: takes only DATA bits, present or not; strict: length and bits checking if ($s=='') return NULL; $l=strlen($s); $o=ord($s{0}); if ($o <= 0x7f && $l==1) return $o; if ($l>6 && $strict) return false; if ($strict) for ($i=1;$i<$l;$i++) if (ord($s{$i}) > 0xbf || ord($s{$i})< 0x80) return false; if ($o < 0xc2) return false; // no-go even if strict=0 if ($o <= 0xdf && ($l=2 && $strict)) return (($o & 0x1f) << 6 | (ord($s{1}) & 0x3f)); if ($o <= 0xef && ($l=3 && $strict)) return (($o & 0x0f) << 12 | (ord($s{1}) & 0x3f) << 6 | (ord($s{2}) & 0x3f)); if ($o <= 0xf7 && ($l=4 && $strict)) return (($o & 0x07) << 18 | (ord($s{1}) & 0x3f) << 12 | (ord($s{2}) & 0x3f) << 6 | (ord($s{3}) & 0x3f)); if ($o <= 0xfb && ($l=5 && $strict)) return (($o & 0x03) << 24 | (ord($s{1}) & 0x3f) << 18 | (ord($s{2}) & 0x3f) << 12 | (ord($s{3}) & 0x3f) << 6 | (ord($s{4}) & 0x3f)); if ($o <= 0xfd && ($l=6 && $strict)) return (($o & 0x01) << 30 | (ord($s{1}) & 0x3f) << 24 | (ord($s{2}) & 0x3f) << 18 | (ord($s{3}) & 0x3f) << 12 | (ord($s{4}) & 0x3f) << 6 | (ord($s{5}) & 0x3f)); return false; } // boundary behavior checking $do=array(0x7f,0x7ff,0xffff,0x1fffff,0x3ffffff,0x7fffffff); foreach ($do as $ii) for ($i=$ii;$i<=$ii+1; $i++) { $o=i3u($i); for ($j=0;$j<strlen($o);$j++) print "O[$j]=" . sprintf('%08b',ord($o{$j})) . ", "; print "c=$i, o=[$o].\n"; print "Back: [$o] => [" . u3i($o) . "]\n"; }

28-mar-2007 11:07

<?php function unicon($str, $to_uni = true) { $cp = Array ( "Ð" => "А", "Ð°" => "а", "Ð‘" => "Б", "Ð±" => "б", "Ð’" => "В", "Ð²" => "в", "Ð“" => "Г", "Ð³" => "г", "Ð”" => "Д", "Ð´" => "д", "Ð•" => "Е", "Ðµ" => "е", "Ð" => "Ё", "Ñ‘" => "ё", "Ð–" => "Ж", "Ð¶" => "ж", "Ð—" => "З", "Ð·" => "з", "Ð˜" => "И", "Ð¸" => "и", "Ð™" => "Й", "Ð¹" => "й", "Ðš" => "К", "Ðº" => "к", "Ð›" => "Л", "Ð»" => "л", "Ðœ" => "М", "Ð¼" => "м", "Ð" => "Н", "Ð½" => "н", "Ðž" => "О", "Ð¾" => "о", "ÐŸ" => "П", "Ð¿" => "п", "Ð " => "Р", "Ñ€" => "р", "Ð¡" => "С", "Ñ" => "с", "Ð¢" => "Т", "Ñ‚" => "т", "Ð£" => "У", "Ñƒ" => "у", "Ð¤" => "Ф", "Ñ„" => "ф", "Ð¥" => "Х", "Ñ…" => "х", "Ð¦" => "Ц", "Ñ†" => "ц", "Ð§" => "Ч", "Ñ‡" => "ч", "Ð¨" => "Ш", "Ñˆ" => "ш", "Ð©" => "Щ", "Ñ‰" => "щ", "Ðª" => "Ъ", "ÑŠ" => "ъ", "Ð«" => "Ы", "Ñ‹" => "ы", "Ð¬" => "Ь", "ÑŒ" => "ь", "Ð" => "Э", "Ñ" => "э", "Ð®" => "Ю", "ÑŽ" => "ю", "Ð¯" => "Я", "Ñ" => "я" ); if ($to_uni) { $str = strtr($str, $cp); } else { foreach ($cp as $c) { $cpp[$c] = array_search($c, $cp); } $str = strtr($str, $cpp); } return $str; } ?>

utf8_encode

Encodes an ISO-8859-1 string to UTF-8 (PHP 4, PHP 5)
string utf8_encode ( string data )

Related Examples ( Source code ) » utf8_encode

Code Examples / Notes » utf8_encode

Change Language

utf8_encode

Encodes an ISO-8859-1 string to UTF-8 (PHP 4, PHP 5) string utf8_encode ( string data )

Related Examples ( Source code ) » utf8_encode

Code Examples / Notes » utf8_encode

Change Language

Encodes an ISO-8859-1 string to UTF-8 (PHP 4, PHP 5)
string utf8_encode ( string data )