Skip to content
Projects
Groups
Snippets
Help
This project
Loading...
Sign in / Register
Toggle navigation
P
project2
Overview
Overview
Details
Activity
Cycle Analytics
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Charts
Issues
0
Issues
0
List
Board
Labels
Milestones
Merge Requests
0
Merge Requests
0
CI / CD
CI / CD
Pipelines
Jobs
Schedules
Charts
Wiki
Wiki
Snippets
Snippets
Members
Collapse sidebar
Close sidebar
Activity
Graph
Charts
Create a new issue
Jobs
Commits
Issue Boards
Open sidebar
20200203098
project2
Commits
e7c142b0
Commit
e7c142b0
authored
Jun 29, 2020
by
Yuan
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
Add new file
parent
28911e57
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
23 additions
and
0 deletions
+23
-0
Project2/code/review
+23
-0
No files found.
Project2/code/review
0 → 100644
View file @
e7c142b0
Summary:
整体完成了,还有需要改进的地方
Detailed Comments:
1、有些包需要先导入,比如pandas
2、在train_data这部分,这行代码有点问题,company_main_name 是一个list,需要company_main_name = ''.join(lst)处理一下 if company_main_name not in ner_dict_new:
3、这里不建议加.toarray() ,因为加了就特别占用内存,如果数据量大的话,很容易就爆内存了
X_tfidf =TfidfVectorizer().fit_transform(corpus).toarray()
4、评估指标可以设置多个指标,方便具体查看
5、C的设置一般也不会设置这么大,C越小正则化越强
Overall Score: 85
--------------------------------------------------------------------------------------------------------------------------
Thanks for your efforts.
-Your instructor
\ No newline at end of file
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment