编程问答

前端开发问题 Java开发问题 C/C++开发问题 Python开发问题 C#/.NET开发问题 php开发问题 移动开发问题 数据库问题

如何从 .NET 字符串中获取 Unicode 代码点数组?

2023-05-20C#/.NET开发问题

1

本文介绍了如何从 .NET 字符串中获取 Unicode 代码点数组?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着跟版网的小编来一起学习吧！

问题描述

限时送ChatGPT账号..

我有一个字符范围限制列表，我需要检查一个字符串，但 .NET 中的 char 类型是 UTF-16，因此某些字符会变成古怪的(代理)对.因此，当枚举 string 中的所有 char 时，我没有得到 32 位 Unicode 代码点，并且一些高值比较失败.

I have a list of character range restrictions that I need to check a string against, but the char type in .NET is UTF-16 and therefore some characters become wacky (surrogate) pairs instead. Thus when enumerating all the char's in a string, I don't get the 32-bit Unicode code points and some comparisons with high values fail.

我对 Unicode 有足够的了解，如有必要，我可以自己解析字节，但我正在寻找 C#/.NET Framework BCL 解决方案.所以...

I understand Unicode well enough that I could parse the bytes myself if necessary, but I'm looking for a C#/.NET Framework BCL solution. So ...

如何将 string 转换为 32 位 Unicode 代码点的数组 (int[])?

How would you convert a string to an array (int[]) of 32-bit Unicode code points?

推荐答案

这个答案不正确.请参阅@Virtlink 的正确答案.

static int[] ExtractScalars(string s)
{
  if (!s.IsNormalized())
  {
    s = s.Normalize();
  }

  List<int> chars = new List<int>((s.Length * 3) / 2);

  var ee = StringInfo.GetTextElementEnumerator(s);

  while (ee.MoveNext())
  {
    string e = ee.GetTextElement();
    chars.Add(char.ConvertToUtf32(e, 0));
  }

  return chars.ToArray();
}

注意事项:处理复合字符需要规范化.

Notes: Normalization is required to deal with composite characters.

这篇关于如何从 .NET 字符串中获取 Unicode 代码点数组?的文章就介绍到这了，希望我们推荐的答案对大家有所帮助，也希望大家多多支持跟版网！

The End

相关推荐

C# 中的多播委托奇怪行为?

C# 中的多播委托奇怪行为?

Multicast delegate weird behavior in C#?(C# 中的多播委托奇怪行为?)...

2023-11-11 C#/.NET开发问题

6

参数计数与调用不匹配?

参数计数与调用不匹配?

Parameter count mismatch with Invoke?(参数计数与调用不匹配?)...

2023-11-11 C#/.NET开发问题

26

如何将代表存储在列表中

如何将代表存储在列表中

How to store delegates in a List(如何将代表存储在列表中)...

2023-11-11 C#/.NET开发问题

6

代表如何工作(在后台)?

代表如何工作(在后台)?

How delegates work (in the background)?(代表如何工作(在后台)?)...

2023-11-11 C#/.NET开发问题

5

没有 EndInvoke 的 C# 异步调用?

没有 EndInvoke 的 C# 异步调用?

C# Asynchronous call without EndInvoke?(没有 EndInvoke 的 C# 异步调用?)...

2023-11-11 C#/.NET开发问题

2

Delegate.CreateDelegate() 和泛型:错误绑定到目标方法

Delegate.CreateDelegate() 和泛型:错误绑定到目标方法

Delegate.CreateDelegate() and generics: Error binding to target method(Delegate.CreateDelegate() 和泛型:错误绑定到目标方法)...

2023-11-11 C#/.NET开发问题

14

热门文章

1阅读完 JSON 内容后遇到的附加文本: 2Excel 错误 HRESULT: 0x800A03EC 尝试使用单元格名称获取范围 3承载错误 - invalid_token - 未找到签名密钥 4反序列化 Newtonsoft.Json 中的自定义异常 5RabbitMQ 连接错误没有一个指定的端点是可达的" 6“由于系统缺乏足够的缓冲区空间或队列已满，无法对套接字执行操作" 7使用 System.IdentityModel.Tokens.Jwt 解码和验证 JWT 令牌 8Linq - 在多个 (OR) 条件下进行左连接

热门精品源码

最新VIP资源

1多功能实用站长工具箱html功能模板 2多风格简历在线生成程序网页模板 3论文相似度查询系统源码 4响应式旅游景点宣传推广页面模板 5在线起名宣传推广网站源码 6酷黑微信小程序网站开发宣传页模板 7房产销售交易中介网站模板 8小学作业自动生成程序