DIY: UTF-8 Validation
Explore how to validate UTF-8 encoding through implementing a function that checks if an array of integers forms valid UTF-8 data. Understand bit-level rules of UTF-8 characters and learn to apply them in Go for cyber security problem solving.
We'll cover the following...
We'll cover the following...
Problem description
Given an integer array data, return whether it is a valid UTF-8 encoding.
A character in UTF8 can be from 1 to 4 bytes long, subject to the following rules:
- For a
1byte character, the first bit of the packet is0, followed by its Unicode code. - For an
n-bytescharacter, the first