Browse Definitions :
Definition

UTF-16 (16- bit Unicode Transformation Format)

UTF-16 (16- bit Unicode Transformation Format) is a standard method of encoding Unicode character data. Part of the Unicode Standard version 3.0 (and higher-numbered versions), UTF-16 has the capacity to encode all currently defined Unicode characters. UTF-16 is specified in Annex Q of the ISO/IEC 10646 standard and in the IETF RFC 2781.

Unicode is designed to accommodate all of the world's known writing systems. The system currently employs three different encodings to represent Unicode character sets: UTF-8, UTF-16 and UTF-32. Each encoding defines a system whereby characters in some character set may be represented in binary form in a file . Each such binary representation of a character is called a code point. Unicode can define over one million distinct encodings (10FFFF code points in hexadecimal ; 1,114,112 in decimal). Unicode code points are divided into 17 planes, of which Planes 0 through 2 are most common:

  • Plane 0, known as the Basic Multilingual Pane (BMP) contains characters for almost all modern languages as well as most common special characters.
  • Plane 1, known as the Supplementary Multilingual Plane (SMP) is used primarily for historic scripts such as Linear B and for musical and mathematical symbols .
  • Plane 2, known as the Supplementary Ideographic Plane (SIP), is used for about 40,000 Unified Han Ideographs seldom used in daily written communications.
The remaining planes are, as yet, largely unused.

UTF-16 encodes characters into specific binary sequences using either one or two 16-bit sequences. Because there are three different encoding schemes to map code points to 8-bit or octet sequences, there are three different encoding schemes around the basic 16-bit sequence model.

UTF-16 is sometimes used interchangeably with UCS-2 although such use is not strictly correct.

This was last updated in March 2011

Continue Reading About UTF-16 (16- bit Unicode Transformation Format)

Start the conversation

Send me notifications when other members comment.

Please create a username to comment.

SearchCompliance

  • risk assessment

    Risk assessment is the identification of hazards that could negatively impact an organization's ability to conduct business.

  • PCI DSS (Payment Card Industry Data Security Standard)

    The Payment Card Industry Data Security Standard (PCI DSS) is a widely accepted set of policies and procedures intended to ...

  • risk management

    Risk management is the process of identifying, assessing and controlling threats to an organization's capital and earnings.

SearchSecurity

  • tokenization

    Tokenization is the process of replacing sensitive data with unique identification symbols that retain all the essential ...

  • incident response

    Incident response is an organized approach to addressing and managing the aftermath of a security breach or cyberattack, also ...

  • Federal Information Security Management Act (FISMA)

    The Federal Information Security Management Act (FISMA) is United States legislation that defines a framework of guidelines and ...

SearchHealthIT

SearchDisasterRecovery

  • call tree

    A call tree is a layered hierarchical communication model that is used to notify specific individuals of an event and coordinate ...

  • Disaster Recovery as a Service (DRaaS)

    Disaster recovery as a service (DRaaS) is the replication and hosting of physical or virtual servers by a third party to provide ...

  • cloud disaster recovery (cloud DR)

    Cloud disaster recovery (cloud DR) is a combination of strategies and services intended to back up data, applications and other ...

SearchStorage

Close