Hi Gilles,

 

Thanks for pointing out the whitelist I was not aware of. With this information I found the GOOD_CHARACTERS list in tests/scripts/check_files.py. You might want to consider a short info about UTF-8 usage in https://mbed-tls.readthedocs.io/en/latest/kb/development/mbedtls-coding-standards/, but I can create an issue for that, if you prefer.

 

Thanks

Stephan

 

 

From: Gilles Peskine <gilles.peskine@arm.com>
Date: Sunday, 12 May 2024 at 20:33
To: Minos Galanakis <Minos.Galanakis@arm.com>, mbed-tls@lists.trustedfirmware.org <mbed-tls@lists.trustedfirmware.org>, Stephan Koch <koch@oberon.ch>
Subject: Re: [mbed-tls] Re: UTF-8 in code files

What's wrong with that?

It's not the first non-ASCII character in the source code, not even the first in a public header.

We have a whitelist of Unicode characters that includes diacritics over Latin characters for bibliographical references and some common punctuation and mathematical characters.

Best regards,

--
Gilles Peskine
Mbed TLS developer

On 09/05/2024 16:27, Minos Galanakis via mbed-tls wrote:

Hi Stephan,

 

Thank you for spotting that. That is not intended and will be looked into.

 

Minos


From: Stephan Koch via mbed-tls <mbed-tls@lists.trustedfirmware.org>
Sent: 08 May 2024 12:44
To: mbed-tls@lists.trustedfirmware.org <mbed-tls@lists.trustedfirmware.org>
Subject: [mbed-tls] UTF-8 in code files

 

Hello Mbed TLS team,

 

Mbed TLS 3.6 introduced the first UTF-8 characters “±” in source code, see mbedtls_config.h, line 4179:

 

* at the same pace. The typical accuracy of an RTC crystal is ±100 to ±20 parts

 

Is this intended?

 

Thanks

Stephan