Tools to help with ASCII in UTF-8
Replace accented UTF-8 characters by unaccented ASCII-7 "equivalents".
The purpose of this function is to replace characters commonly found in Latin alphabets with something more or less equivalent from the ASCII range. This can be useful for converting a UTF-8 to something ready for a filename, for example. Following the use of this function, you would probably also pass the string through utf8_strip_non_ascii to clean out any other non-ASCII chars Use the optional parameter to just deaccent lower ($case = -1) or upper ($case = 1) letters. Default is to deaccent both cases ($case = 0)
Tests whether a string contains only 7bit ASCII bytes.
You might use this to conditionally check whether a string needs handling as UTF-8 or not, potentially offering performance benefits by using the native PHP equivalent if it's just ASCII e.g.;
- <?php
- if ( utf8_is_ascii($someString) ) {
- // It's just ASCII - use the native PHP version
- $someString = strtolower($someString);
- } else {
- $someString = utf8_strtolower($someString);
- }
- ?>
Tests whether a string contains only 7bit ASCII bytes with device control codes omitted. The device control codes can be found on the second table here: http://www.w3schools.com/tags/ref_ascii.asp
Strip out all non-7bit ASCII bytes If you need to transmit a string to system which you know can only support 7bit ASCII, you could use this function.
Strip out all non 7bit ASCII bytes and ASCII device control codes.
For a list of ASCII device control codes see the 2nd table here: http://www.w3schools.com/tags/ref_ascii.asp
Documentation generated on Fri, 17 Mar 2006 00:28:07 +0100 by phpDocumentor 1.3.0RC4