


























Hi HN,
This project distills a model's word embeddings into human-interpretable "concept-vectors", i.e. vectors in which each component tracks concerns like semantics, syntax, and even statistics potentially, while associating each component with a human readable and human definable label.
If you're in a rush, glance at the core concepts[1]. Then take a look at the scratch notebook[2] to get a quick idea of where I'm trying to go. If you find these intriguing, then go through the readme and supporting docs.
I want to acknowledge explicitly that this is a data design project. I have quite a bit of experience with data transformation and manipulation, but limited experience with NNs. I have not tested this on models, and I currently don't have the resources to build a comprehensive database to test it on models. I'm posting primarily for human feedback/criticism, and simply to share the idea since this is as far as I can currently take it.
[1] https://github.com/truehumanexe/concept_vector/blob/main/doc... [2] https://github.com/truehumanexe/concept_vector/blob/main/not...
此内容由惯性聚合(RSS阅读器)自动聚合整理,仅供阅读参考。 原文来自 — 版权归原作者所有。