问题描述
我正在尝试在 ColdFusion 中规范化字符串.
I'm trying to normalize a string in ColdFusion.
我想为此使用Java类java.text.Normalizer,因为据我所知CF没有任何类似的功能.
I want to use the Java class java.text.Normalizer for this, as CF doesn't have any similar functions as far as I know.
这是我当前的代码:
<cfset normalizer = createObject( "java", "java.text.Normalizer" ) />
<cfset string = "éè" />
<cfset string = normalizer.normalize(string, createObject( "java", "java.text.Normalizer$Form" ).NFD) />
<cfset string = ReReplace(string, "\p{InCombiningDiacriticalMarks}+", "") />
<cfoutput>#string#</cfoutput>
任何想法为什么它总是输出 éè 而不是规范化字符串?
Any ideas why it always outputs éè and not a normalized string?
推荐答案
在 ColdFusion 中,与 Java 不同,您不需要在字符串文字中转义反斜杠.您当前的正则表达式不会匹配不以反斜杠开头的任何内容,因此不会发生替换.
In ColdFusion, unlike in Java, you don't need to escape backslashes in string literals. Your current regex will not match anything that does not start with a backslash, so no replacement happens.
除此之外,您的代码完全正确,您可以看到输出时字符串的长度是 8,而不是 4.这是 normalize 调用的效果.
Other than that, your code is perfectly correct and you can see that the length of the string is 8, not 4, at the time of the output. This is an effect of the normalize call.
但是,请记住,它仍然是原始字符串的等效表示,因此您无法从视觉上区分差异也就不足为奇了.这是正确的 Unicode 渲染.
However, remember that it is still an equivalent representation of the original string, and so it is not surprising that you cannot tell the difference visually. This is correct Unicode rendering in action.
这篇关于在 ColdFusion 中规范化字符串的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持跟版网!


大气响应式网络建站服务公司织梦模板
高端大气html5设计公司网站源码
织梦dede网页模板下载素材销售下载站平台(带会员中心带筛选)
财税代理公司注册代理记账网站织梦模板(带手机端)
成人高考自考在职研究生教育机构网站源码(带手机端)
高端HTML5响应式企业集团通用类网站织梦模板(自适应手机端)