Skip to content
Projects
Groups
Snippets
Help
This project
Loading...
Sign in / Register
Toggle navigation
P
project2
Overview
Overview
Details
Activity
Cycle Analytics
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Charts
Issues
0
Issues
0
List
Board
Labels
Milestones
Merge Requests
0
Merge Requests
0
CI / CD
CI / CD
Pipelines
Jobs
Schedules
Charts
Wiki
Wiki
Snippets
Snippets
Members
Collapse sidebar
Close sidebar
Activity
Graph
Charts
Create a new issue
Jobs
Commits
Issue Boards
Open sidebar
20200203098
project2
Commits
e7c142b0
Commit
e7c142b0
authored
4 years ago
by
Yuan
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
Add new file
parent
28911e57
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
23 additions
and
0 deletions
+23
-0
Project2/code/review
+23
-0
No files found.
Project2/code/review
0 → 100644
View file @
e7c142b0
Summary:
整体完成了,还有需要改进的地方
Detailed Comments:
1、有些包需要先导入,比如pandas
2、在train_data这部分,这行代码有点问题,company_main_name 是一个list,需要company_main_name = ''.join(lst)处理一下 if company_main_name not in ner_dict_new:
3、这里不建议加.toarray() ,因为加了就特别占用内存,如果数据量大的话,很容易就爆内存了
X_tfidf =TfidfVectorizer().fit_transform(corpus).toarray()
4、评估指标可以设置多个指标,方便具体查看
5、C的设置一般也不会设置这么大,C越小正则化越强
Overall Score: 85
--------------------------------------------------------------------------------------------------------------------------
Thanks for your efforts.
-Your instructor
\ No newline at end of file
This diff is collapsed.
Click to expand it.
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment