(grep) 正则表达式匹配非 ASCII 字符？ - (grep) Regex to match non-ASCII characters?

原創

2021-10-19 21:19

问题：

On Linux, I have a directory with lots of files.在 Linux 上，我有一个包含大量文件的目录。 Some of them have non-ASCII characters, but they are all valid UTF-8 .其中一些具有非 ASCII 字符，但它们都是有效的UTF-8 。 One program has a bug that prevents it working with non-ASCII filenames, and I have to find out how many are affected.一个程序有一个错误，阻止它使用非 ASCII 文件名，我必须找出有多少受到影响。 I was going to do this with find and then do a grep to print the non-ASCII characters, and then do a wc -l to find the number.我打算用find来做这个，然后用grep来打印非 ASCII 字符，然后用wc -l来查找数字。 It doesn't have to be grep;它不必是 grep； I can use any standard Unix regular expression , like Perl , sed , AWK , etc.我可以使用任何标准的 Unix正则表达式，如Perl 、 sed 、 AWK等。

However, is there a regular expression for 'any character that's not an ASCII character'?但是，是否有“任何不是 ASCII 字符的字符”的正则表达式？

解决方案：

参考一： https://en.stackoom.com/question/8uYE
参考二： https://stackoom.com/question/8uYE

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

(grep) 正则表达式匹配非 ASCII 字符？ - (grep) Regex to match non-ASCII characters?

问题：

解决方案：

如何基于surging跨网关跨语言进行缓存降级

2024合集

程序员天天 CURD，怎么才能成长，职业发展的思考(2)

移位操作搞定两数之商

教你用Perl实现Smgp协议

如何通过前端表格控件在10分钟内完成一张分组报表？

win11关闭自动检测病毒删文件

通用代码生成器简介

lightdb 单机模式下数据库平移

千兆宽带实际网速能到达多少？

在沒有jQuery的情況下查找最近的元素 - Finding closest element without jQuery

Chrome 未在“網絡”選項卡中顯示 OPTIONS 請求 - Chrome not showing OPTIONS requests in Network tab

如何在Django中獲取所有請求標頭？ - How can I get all the request headers in Django?

如何在 Jersey JaxRS 中獲取所有查詢參數？ - How can I grab all query parameters in Jersey JaxRS?

在 AngularJS 中的 ng-repeat 循環中綁定 ng-model - Binding ng-model inside ng-repeat loop in AngularJS

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結