Golang 的 Naive Bayesian 类别 bayesian

xuanbao • • 1189 次点击

这是一个分享于的项目，其中的信息可能已经有所发展或是发生改变。

支持 TF-IDF 的 Naive Bayesian 分类。特点： * 条件概率和“对数似然”分值。 * 下溢检测。 * 分类器简单持久。 * 统计。 ### 举例1 (plain no tf-idf) 使用分类器，先创建一个分类并测试： <pre box-sizing:="" font-family:="" liberation="" font-size:="" margin-top:="" margin-bottom:="" font-stretch:="" line-height:="" word-wrap:="" padding:="" overflow:="" background-color:="" border-radius:="" word-break:="">import . "bayesian"const ( Good Class = "Good" Bad Class = "Bad")classifier := NewClassifier(Good, Bad)goodStuff := []string{"tall", "rich", "handsome"}badStuff := []string{"poor", "smelly", "ugly"} classifier.Learn(goodStuff, Good) classifier.Learn(badStuff, Bad)</pre> 然后你可以查明每个类的分值类的数据所属 : <pre box-sizing:="" font-family:="" liberation="" font-size:="" margin-top:="" margin-bottom:="" font-stretch:="" line-height:="" word-wrap:="" padding:="" overflow:="" background-color:="" border-radius:="" word-break:="">scores, likely, _ := classifier.LogScores( []string{"tall", "girl"} )</pre> 分数的大小表示似然性。另外（但浮溢的一些风险），但可以得到实际的概率 : <pre box-sizing:="" font-family:="" liberation="" font-size:="" margin-top:="" margin-bottom:="" font-stretch:="" line-height:="" word-wrap:="" padding:="" overflow:="" background-color:="" border-radius:="" word-break:="">probs, likely, _ := classifier.ProbScores( []string{"tall", "girl"} )</pre> ### [ ](https://github.com/jbrukh/bayesian#example-2-tf-idf)举例2 (TF-IDF) 在分类方法（LogScore， ProbSafeScore ， ProbScore ）之前，要使用 TF-IDF分类，首先必须创建一些类和测试它，之后你需要调用ConvertTermsFreqToTfIdf() 。 <pre box-sizing:="" font-family:="" liberation="" font-size:="" margin-top:="" margin-bottom:="" font-stretch:="" line-height:="" word-wrap:="" padding:="" overflow:="" background-color:="" border-radius:="" word-break:="">import . "bayesian"const ( Good Class = "Good" Bad Class = "Bad")classifier := NewClassiferTfIdf(Good, Bad) // Extra constructorgoodStuff := []string{"tall", "rich", "handsome"}badStuff := []string{"poor", "smelly", "ugly"} classifier.Learn(goodStuff, Good) classifier.Learn(badStuff, Bad) classifier.ConvertTermsFreqToTfIdf() // IMPORTANT !!</pre> 然后你可以查明每个类的分值和类的数据所属 : <pre box-sizing:="" font-family:="" liberation="" font-size:="" margin-top:="" margin-bottom:="" font-stretch:="" line-height:="" word-wrap:="" padding:="" overflow:="" background-color:="" border-radius:="" word-break:="">scores, likely, _ := classifier.LogScores( []string{"tall", "girl"} )</pre> 分数的大小表示似然性。另外（但浮溢的一些风险），但可以得到实际的概率 : <pre box-sizing:="" font-family:="" liberation="" font-size:="" margin-top:="" margin-bottom:="" font-stretch:="" line-height:="" word-wrap:="" padding:="" overflow:="" background-color:="" border-radius:="" word-break:="">probs, likely, _ := classifier.ProbScores( []string{"tall", "girl"} )</pre>

授权协议：: BSD
开发语言：: Google Go 查看源码»
操作系统：: 跨平台

1189 次点击

加入收藏微博

分类器

测试

下溢

github

0 回复

添加一条新回复（您需要登录后才能回复没有账号？）

请尽量让自己的回复能够对别人有帮助
支持 Markdown 格式, **粗体**、~~删除线~~、`单行代码`
支持 @ 本站用户；支持表情（输入 : 提示），见 Emoji cheat sheet
图片支持拖拽、截图粘贴等方式上传

Golang 的 Naive Bayesian 类别 bayesian

用户登录

今日阅读排行

一周阅读排行