python怎么对文本进行词频统计

2025-02-13 2410

核心提示：使用Python对文本进行词频统计可以使用下面的步骤：打开文本文件并读取文本内容。with open(text.txt, r) as file:text = file.r

使用Python对文本进行词频统计可以使用下面的步骤：

打开文本文件并读取文本内容。

with open("text.txt", "r") as file:    text = file.read()

对文本进行分词。

import re# 去除标点符号和空白字符text = re.sub(r'[^\w\s]', '', text)# 将文本拆分为单词列表words = text.split()

统计每个单词的出现次数。

from collections import Counterword_count = Counter(words)

排序并输出词频结果。

for word, count in word_count.most_common():    print(word, count)

完整的代码如下：

import refrom collections import Counterwith open("text.txt", "r") as file:    text = file.read()text = re.sub(r'[^\w\s]', '', text)words = text.split()word_count = Counter(words)for word, count in word_count.most_common():    print(word, count)

请确保将代码中的"text.txt"替换为实际的文本文件路径。

点赞 0举报打赏 0评论 0

更多>同类维修知识

推荐图文

vb组合框下拉内容怎么

推荐维修知识

点击排行

• matlab如何求二阶导数	• mysql怎么防止sql注入
• java防止sql注入的方式有哪些	• 电脑屏幕上出现无信号的原因有哪些
• 电脑屏幕黑屏但主机正常如何解决	• 电脑显示ip冲突如何解决
• Windows如何看IP是否冲突	• 怎么从hbase读取数据导入mongodb
• mongodb分片集群生产环境怎么配置	• php防止sql注入的方法有哪些