惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

Simon Willison's Weblog
Simon Willison's Weblog
Help Net Security
Help Net Security
P
Privacy International News Feed
T
Threat Research - Cisco Blogs
C
Cisco Blogs
C
CERT Recently Published Vulnerability Notes
NISL@THU
NISL@THU
L
LINUX DO - 热门话题
Security Latest
Security Latest
A
Arctic Wolf
G
GRAHAM CLULEY
月光博客
月光博客
S
Securelist
D
Docker
J
Java Code Geeks
T
Troy Hunt's Blog
T
Tenable Blog
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
SecWiki News
SecWiki News
S
Security @ Cisco Blogs
量子位
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
L
LINUX DO - 最新话题
Recent Commits to openclaw:main
Recent Commits to openclaw:main
aimingoo的专栏
aimingoo的专栏
博客园 - 【当耐特】
H
Heimdal Security Blog
The Hacker News
The Hacker News
博客园 - 三生石上(FineUI控件)
Application and Cybersecurity Blog
Application and Cybersecurity Blog
N
Netflix TechBlog - Medium
Vercel News
Vercel News
Forbes - Security
Forbes - Security
B
Blog RSS Feed
H
Hackread – Cybersecurity News, Data Breaches, AI and More
IT之家
IT之家
B
Blog
MongoDB | Blog
MongoDB | Blog
博客园 - 聂微东
Google DeepMind News
Google DeepMind News
S
Secure Thoughts
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
C
Check Point Blog
云风的 BLOG
云风的 BLOG
CTFtime.org: upcoming CTF events
CTFtime.org: upcoming CTF events
T
The Blog of Author Tim Ferriss
L
Lohrmann on Cybersecurity
F
Full Disclosure
D
Darknet – Hacking Tools, Hacker News & Cyber Security
P
Proofpoint News Feed

程序萌部落

Problems in explanations of projections Cut off the tail [redundant parts] Describe one thing in 10 minutes 为什么开启这个里世界? Important rules in English writing Some writing pitfalls Summary of 2020 - Part B 2020年终总结(A)随便写写 Understand complex tables in one minute 我眼中的《姜子牙》——三界官场现形记 东北亚离战争到底有多远?
Try to explain the projections.
程序萌部落 · 2021-01-25 · via 程序萌部落

Multi-dimensional data is a very important type of data since it covers almost every kind of dataset. Most of them could be formatted into regular tables or datasheets, where the columns are called their dimensionalities. Researchers already have a lot of experience dealing with these datasets, but in the age of big data, the expansion of the number of dimensionalities (columns) is already made it big trouble for people to handle them.

Researchers have created a huge amount of methods to reduce the dimensionality to let people could find insight information in an understandable way. The basic idea is to compute the importance of different dimensions and preserve the principle dimensionalities (PCA). If dimensionality reduced to 2 or 3, then we can understand them, because it’s easy to show them reduce data with 2D or 3D space. But the problem is the losing details could play a big role in the process of our understanding. How to find these missing messages or how to evaluate the result of dimensionality reduction has become a key problem. In other words, giving good explanations to these results is the final aim that we want to reach.

<< Read more articles in https://www.cxmoe.com >>

With extra interactions, some methods could let people have global explanations by adding axes to the 2/3D space. But they can’t give much more local explanations, meanwhile, the existing methods that have local explanations are all heavily dependent on interactions.
Thus, we introduced the image-based method to compute the local explanations. It allows users to get local explanations and could complete the computation in one time without extra interaction. We designed three models to compute the local information and finally created six methods to test. We used more than 20 datasets to test every method and gave detailed illustrations to these explanations we get. Using the intrinsic structure we have known, we verified the rationality of our explanations.

Now, this research is still moving on, the future work will focus on the fluence of different parameters, and the evaluation methods for our explanations. With multiple good explanations we computed, we aim to give users more extra understanding so that guide them to choose the best dimensionality reduction method to handle their dataset.