Fixes in unicode helper with convert to/from UTF8#3247
Fixes in unicode helper with convert to/from UTF8#3247josesimoes merged 1 commit intonanoframework:mainfrom
Conversation
- Issue with null terminator buffer size. - Invalid surrogate pair handling where the second character is put back for reprocessing. - Proper advancement of input pointers for invalid sequences. - Edge cases with partial multi-byte sequences. - Overlong encoding detection. - Boundary conditions and valid/invalid transitions.
WalkthroughModified Unicode conversion functions in the CLR runtime helper to improve handling of invalid UTF-8 and UTF-16 sequences. Changes include more aggressive input pointer advancement on errors, explicit validation of continuation bytes and surrogates, substitution of invalid sequences with U+FFFD replacement characters, and boundary condition refinements in finalization logic. Changes
Estimated code review effort🎯 4 (Complex) | ⏱️ ~45 minutes Areas requiring extra attention:
Pre-merge checks and finishing touches❌ Failed checks (1 warning)
✅ Passed checks (2 passed)
✨ Finishing touches
🧪 Generate unit tests (beta)
📜 Recent review detailsConfiguration used: CodeRabbit UI Review profile: CHILL Plan: Pro 📒 Files selected for processing (1)
🔇 Additional comments (5)
Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. Comment |
Description
Motivation and Context
How Has This Been Tested?
Screenshots
Types of changes
Checklist
Summary by CodeRabbit
✏️ Tip: You can customize this high-level summary in your review settings.