Fixed-width strings: Difference between revisions

Fixed-width strings (view source)

Revision as of 17:03, 2 May 2008

487 bytes added , 2 May 2008

m

→‎String creation

Daumling

55

edits

@@ Line 17: / Line 17: @@
 === String creation ===
-Strings may either be created with 8, 16, or 32 bit data. In addition, string may be created with UTF-8 data, which results in the smallest width that can hold the data.
+Strings may either be created with 8, 16, or 32 bit data. In addition, strings may be created with UTF-8 data, which results in the smallest width that can hold the data.
-String are created using static creator function. This allows the implementation to use raw memory allocation and in-place constructor calls to avoid having to do two memory allocations, one for the instance, and the other for the data. Strings created that way contain the data right behind the instance data.
+String are created using static creator functions. This allows the implementation to use raw memory allocation and in-place constructor calls to avoid having to do two memory allocations, one for the instance, and the other for the data. Strings created that way contain the data right behind the instance data.
 The maximum string width determines the way strings are created. It is an optional argument to the string constructors.
 # 8 bits: If the source data contains 16 or 32 bit data, the return value is null.
 #16 bits: If the source data contains 32 bit values, surrogate pairs are created. If a character is > 0x10FFFF, null is returned.
+This allows implementers to define the maximum width of strings; they can choose to use 8, 16 or 32 bits throughout, or they can choose to go with whatever width that fits best. If they choose best-fit widths, string creation methods do not create UTF-16 surrogate pairs. If a script creates surrogate pairs, these will remain in strings, though, although a flattening operation could detect surrogate pairs and widen the flattened string to 32 bits. This should be a global setting.
 ''Question: How are out-of-memory conditions handled? The current implementation often just assumes success. There should be some sort of exception, and the same mechanism should be used to report strings that cannot be created.''