exactly, that's the standard.
It is not correct to say unicode is 2 bytes. (maybe once it was)
Unicode gives a number to every symbol. Then there are several ways to pass the numbers as bytes. The one MarkV says it is called UTF-16.
In URLs instead UTF-8 is used.