Commit e7c142b0 by Yuan

Add new file

parent 28911e57
Summary:
整体完成了,还有需要改进的地方
Detailed Comments:
1、有些包需要先导入,比如pandas
2、在train_data这部分,这行代码有点问题,company_main_name 是一个list,需要company_main_name = ''.join(lst)处理一下 if company_main_name not in ner_dict_new:
3、这里不建议加.toarray() ,因为加了就特别占用内存,如果数据量大的话,很容易就爆内存了
X_tfidf =TfidfVectorizer().fit_transform(corpus).toarray()
4、评估指标可以设置多个指标,方便具体查看
5、C的设置一般也不会设置这么大,C越小正则化越强
Overall Score: 85
--------------------------------------------------------------------------------------------------------------------------
Thanks for your efforts.
-Your instructor
\ No newline at end of file
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment