python Programming Glossary: surrogate
Django or similar for composite primary keys http://stackoverflow.com/questions/1624257/django-or-similar-for-composite-primary-keys share improve this question A work around is to create a surrogate key an auto increment column as the primary key column and place..
What is internal representation of string in Python 3.x http://stackoverflow.com/questions/1838170/what-is-internal-representation-of-string-in-python-3-x oriented EXTERNAL representation. Each code unit character surrogate etc has been assigned a number from range 0 2 21 . This is called.. Multilingual Plane BMP unless you want to muck about with surrogates handy if you can't find your hair shirt and your bed of nails..
What is the best approach to change primary keys in an existing Django app? http://stackoverflow.com/questions/2055784/what-is-the-best-approach-to-change-primary-keys-in-an-existing-django-app probably wrong. The formal primary key should always be a surrogate key. Never anything else. Strong words. Been database designer.. key that can be taken as primary. It isn't primary. Only surrogates can be primary. You're doing open heart surgery. Don't mess..
Find all Chinese text in a string using Python and Regex http://stackoverflow.com/questions/2718196/find-all-chinese-text-in-a-string-using-python-and-regex which can only be represented in narrow Unicode builds via surrogate pairs RE re.compile u' ⺀ ⺙â ⻳â ⿕々〇〠©ã€ ºã€»ã 䶵ä 鿃è 鶴侮 »ä¸¦.. # A narrow python build so can't use chars 65535 without surrogate pairs else try L.append unichr i except pass RE ' s ' ''.join..
How to correct bugs in this Damerau-Levenshtein implementation? http://stackoverflow.com/questions/3431933/how-to-correct-bugs-in-this-damerau-levenshtein-implementation TO CHARACTER IDs CIDs # cdef unsigned int _UMX_surrogate_lower_bound 0x10000 cdef unsigned int _UMX_surrogate_upper_bound.. _UMX_surrogate_lower_bound 0x10000 cdef unsigned int _UMX_surrogate_upper_bound 0x10ffff cdef unsigned int _UMX_surrogate_hi_lower_bound.. _UMX_surrogate_upper_bound 0x10ffff cdef unsigned int _UMX_surrogate_hi_lower_bound 0xd800 cdef unsigned int _UMX_surrogate_hi_upper_bound..
How to do a Python split() on languages (like Chinese) that don't use whitespace as word separator? http://stackoverflow.com/questions/3797746/how-to-do-a-python-split-on-languages-like-chinese-that-dont-use-whitespace on the assumption that unicode was still 16 bits so called surrogate pairs were devised where two 16 bit codepoints from specifically.. and encoding. now while narrow CPython deals with surrogate pairs quite transparently in some cases it will still fail to.. ud85f' ' udc3c' 'd' 'e' 'f' with ' ud85f' ' udc3c' being a surrogate pair. incidentally ' ud85f udc3c' is what the JSON standard..
Perl Compatible Regular Expression (PCRE) in Python http://stackoverflow.com/questions/7063420/perl-compatible-regular-expression-pcre-in-python returns 1 for non BMP characters so len ' U0010FFFF' 1 surrogate pairs are not recombined in string literals so ' uDBFF uDFFF'..
|