How to Encode & Scan UTF-8 Unicode Characters

< Back
You are here:
Print

How to Encode & Scan UTF-8 Unicode Characters

UTF-8 is a method of accessing Unicode characters such as Arabic, Greek, Thai, Chinese, Korean, or Ukrainian characters for example. These characters cannot usually be encoded directly in 2D barcodes such as QR Code unless a conversion to byte process is performed. Most IDAutomation QR Code and Data Matrix products include a built-in method to automatically encode UTF-8. Refer to the UTF-8 Unicode Encoding FAQ for more information.

UTF-8 was designed to be backward compatible because the first 128 characters of Unicode, which correspond with ASCII, are encoded using a single byte with the same binary value as ASCII, so that valid ASCII text is also valid UTF-8-encoded Unicode. As of 2023, UTF-8 is the dominant encoding for the Internet.