惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

Apple Machine Learning Research
Apple Machine Learning Research
The GitHub Blog
The GitHub Blog
Hugging Face - Blog
Hugging Face - Blog
阮一峰的网络日志
阮一峰的网络日志
爱范儿
爱范儿
量子位
宝玉的分享
宝玉的分享
人人都是产品经理
人人都是产品经理
博客园_首页
博客园 - 【当耐特】
Last Week in AI
Last Week in AI
Martin Fowler
Martin Fowler
Microsoft Azure Blog
Microsoft Azure Blog
美团技术团队
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
aimingoo的专栏
aimingoo的专栏
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
GbyAI
GbyAI
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
腾讯CDC

Hacker News - Newest: "LLM"

An interactive linear algebra primer aimed at LLM readers Any positive sides of LLM there? Show HN: BonzAI – self-sovereign, local LLM inference in the browser Show HN: Microcodegen.py – PRD → FastAPI app, one file, no LLM calls Release v0.1.2 · syndicalt/llmff Ask HN: What is the least sycophantic frontier LLM? "Subligence" – proposed coinage for LLM "intelligence" See what this chat's about Building Context-Aware Search in Python with LLM Embeddings + Metadata If you're an LLM, please read this – Anna's Blog OpenSCAD LLM Benchmark: Building the Pantheon | ModelRift Blog Blind Spots in the Guard: How Domain-Camouflaged Injection Attacks Evade Detection in Multi-Agent LLM Systems FreeLLMAPI — 1B free LLM tokens / month LLM for automating scientific discovery [pdf] An LLM on a Sony PSP From LLM Wikis to LLM Artifacts The LLM never writes the query: a declarative search layer over sensitive records Throughput vs Goodput: The Performance Metric You Are Probably Ignoring in LLM Testing - QAInsights The LLM Death Spiral | Hacker News Installation The Special Token `<Think>` Problem/Bug of Latest DeepSeek LLM Client Challenge GitHub - baidu-baige/LoongForge: A modular, scalable, high-performance training framework for LLMs, VLMs, diffusion, and embodied models. LLM System Design Benchmark 3.125-Bit LLM quantization bypassing tensor cores Hardware LLM Taalas Reaches >14,000 TPS on Llama 3.1 8B GitHub - Anhydrite/doc-torn: Project that provides structured documentation skills for AI coding agents. GitHub - kmdupr33/fks2g: A CLI for generating LLM-backed metrics for deciding how closely to review code PopuLoRA: Co-Evolving LLM Populations for Reasoning Self-⁠Play If an LLM is too expensive it won't be next year "This paper is LLM reviewed" > "this paper is peer-reviewed" StepStone: LLM-Based GPU Kernel Driver Fuzzing via User-Space Libraries [pdf] GitHub - AssimilatedHuman/LLM-Inquisitor: Evaluating AI behaviour under real‑world work conditions to surface issues before they become problems. LLM INQUISITOR identifies failures (drift, instability etc) by observing AI during normal tasks — a tool the industry desperately needs to stem the 85% failure rate. Includes Quick Start, Practitioner’s Guide and Methodology. Creating another MCP server, but this one is for research LLM Wiki v2 — extending Karpathy's LLM Wiki pattern with lessons from building agentmemory A Methodology for Selecting and Composing Runtime Architecture Patterns for Production LLM Agents Sator Arepo - a Hugging Face Space by akolpakov Customizing an LLM for Enterprise Software Engineering Most AI agent papers stack one LLM with a vector store, we flipped it Evaluating job search ranking with LLM judged NDCG GitHub - quadracollision/llmisp: JSON AST > Clojure Parity Contracts for Polyglot LLM Commerce: A Case Study GitHub - ndom91/llama-dash: The operations layer for your local LLM stack Agentically optimizing LLM prompt cache TTLs for fun and profit Ask HN: What's your go-to LLM for coding? How do you reduce LLM spam in PR reviews? Ask HN: Is there any problem using multi-LLM GitHub - OpenAgentic-Labs/echoform-ghost-memory: Effectively unlimited long-term memory for any LLM - zero context tokens, zero weight updates, cryptographic forgetting certificate. PSA — Posture Sequence Analysis Why More Context Can Make an LLM Worse GitHub - robertoranon/tokoro: A toolbox for building event publish & discovery web sites, apps, feeds, and more GitHub - sermakarevich/chunker: Agentic approach to chunking a document A new EDIT tool for LLM agents LLMCap — Hard Dollar Caps on LLM API Calls MLSys @ WukLab - Nitsum: Serving Tiered LLM Requests with Adaptive Tensor Parallelism SuperInfer: SLO-Aware Rotary Scheduling and Memory Management for LLM Inference on Superchips What political censorship looks like inside an LLM's weights — a mechanistic-interpretability study of Qwen 3.5 Managing metadata is essential in LLM world Fixing LLM Writing with Distribution Fine Tuning twitter.com Show HN: An LLM that's better at writing The local shape of LLM stable regions GitHub - msunda17/impactarbiter-cli The Infrastructure Behind Making Local LLM Agents Useful PostgreSQL ext makes LLM available as an index for similarity searches,inference GitHub - Tetrahedroned/Agent-Braille: Deterministic 8-bit machine-to-machine protocol for AI agent state. ~92% fewer state-tracking tokens on real Claude Code sessions, a proven single-bit-error-safe command code, fully reproducible. Tell HN: Writing an LLM critique/takedown? – Do not use an LLM to write it 🌱 an LLM models our worst behavior Prompt eval cues predicted refusal shifts across 32k LLM rollouts Ask HN: Is Java the ideal language for LLM-assisted coding? AI Foundry – Flat-Fee Unlimited LLM Inference on Blackwell GPUs in NZ LLM tracing with MLflow AI Gateway LLM Performance by Programming Language The LLM Looked Smart. The Metrics Disagreed – tiago.rio.br The Four Horsemen of the LLM Apocalypse GitHub - piqoni/piqo-extension: A good interface is invisible Intro to TLA+ for the LLM Era: Prompt Your Way to Victory Give every tool LLM wiki and bypass Claude Code SSH Throttle The Ultimate LLM Fine-Tuning Guide Ask HN: What LLM models are you using and why? Five Agents, One Browser: Werewolf on Quack + DuckDB LLM models are not ready for orchestrating many agents ClickBook — Offline AI eReader - Apps on Google Play DeepSeek-V4-Flash means LLM steering is interesting again Recent Developments in LLM Architectures: KV Sharing, mHC, and Compressed Attention Recent Developments in LLM Architectures: KV Sharing, MHC, Compressed Attention We Built SynapseKit: The Truth About Production LLM Frameworks GitHub - albedan/ai-ml-gpu-bench: A suite to benchmark CPU/GPU Python performance in training ML models and running local LLMs GitHub - chopratejas/headroom: Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server. if you are redlining the LLM, you aren't headlining Most Meaningful Dates on the Web and for an LLM I tested 8 LLM models on Linux without using the GPU RelaxAI – UK sovereign LLM inference at 80% cheaper than OpenAI/Claude GitHub - Andyyyy64/whichllm: Find the local LLM that actually runs — and performs best — on your hardware. Ranked by real, recency-aware benchmarks, not parameter count. One command, run it instantly. GitHub - krellixlabs/llm-reasoning-research: Curated, annotated research on reasoning gaps in large language models — temporal reasoning, causal reasoning, and beyond. Agentic evals or LLM as a judge? considering cost, time and quality Known By Their Actions: Fingerprinting LLM Browser Agents via UI Traces Add an LLM policy for `rust-lang/rust` by jyn514 · Pull Request #1040 · rust-lang/rust-forge GitHub - nimeshnayaju/markdown-parser: A streaming-capable markdown parser, written in TypeScript Dragos Documents First LLM-Assisted Strike on Water Infrastructure in Mexico
AI Visibility Engineering Glossary — AIMENSION™ Terminology
FrancescoTin · 2026-05-23 · via Hacker News - Newest: "LLM"

51

Canonical Terms

3

Protocol Pillars

Q139783726

Wikidata Entity

A 10 terms

The practice of structuring brand content, entity data, and semantic markup so that AI-powered answer engines select the brand as the authoritative response when users submit relevant queries at inference time. AEO targets deployed AI systems — ChatGPT, Google SGE, Perplexity, Bing Copilot — at the moment they generate a response. Relies primarily on Pillar III (Semantic Injection) and Pillar II (Algorithmic Authority) of the AIMENSION Protocol.

See also: GEO, RAG Pipeline Optimization, llms.txt

The degree to which an organization's brand identity is accurately, consistently, and accessibly encoded within the knowledge systems of artificial intelligence — including LLM training weights, Knowledge Graph nodes, and RAG retrieval corpora. The primary output metric of the AIMENSION Protocol.

See also: AI Visibility Engineering, Source of Truth (AI)

The discipline of optimizing brand entity representation across Large Language Model training corpora, Knowledge Graphs, and semantic metadata layers to maximize citation probability, description accuracy, and recommendation frequency in AI-generated responses. The parent discipline from which AEO, GEO, and Knowledge Graph Engineering derive. Systematized by Francesco Tinti at Axon System through the AIMENSION Protocol.

See also: AIMENSION Protocol, AEO, GEO

A proprietary AI Visibility Engineering methodology developed by Francesco Tinti (Q139765600) for Axon System (Q139766166). Wikidata entity: Q139783726. Defines three mandatory pillars — Entity Ground Truth, Algorithmic Authority, and Semantic Injection — whose mutual cross-referencing creates Semantic Triangulation: a state of entity authority that AI systems converge on as verified truth. Version 1.0.0, published 2025.

See also: Entity Ground Truth, Algorithmic Authority, Semantic Injection, Semantic Triangulation

The second pillar of the AIMENSION Protocol. Credibility established through structured, versioned, machine-readable public documentation in code repositories. Built by publishing GitHub repositories with llms.txt directives, Semantic Versioning changelogs, BibTeX citation blocks, and RAG-optimized markdown. LLM training crawlers assign higher weight to content from authoritative technical domains (GitHub, academic repositories) than to blog posts or social media.

See also: GEO, llms.txt, BibTeX Citation Block

An AI system that generates direct natural-language answers to user queries, as opposed to a traditional search engine that returns a ranked list of URLs. Answer engines include ChatGPT (OpenAI), Gemini (Google), Claude (Anthropic), Perplexity, and Bing Copilot. The growing adoption of answer engines is the primary driver of demand for AI Visibility Engineering.

See also: AEO, LLM, Inference-Time Retrieval

Italian AI Visibility Engineering agency founded by Francesco Tinti. Developer and operator of the AIMENSION Protocol. Wikidata entity: Q139766166. Website: axonsystem.net. Operates at the intersection of Semantic Web standards (Schema.org, JSON-LD, Wikidata) and artificial intelligence infrastructure.

See also: AIMENSION Protocol, AI Visibility Engineering

B 2 terms

A structured citation record in BibTeX format included in technical documentation to enable academic-style citation. BibTeX blocks are recognized by academic literature indexers, citation management systems, and LLM training pipelines that process technical documentation — increasing the probability the document is treated as citable technical work.

See also: Algorithmic Authority

The strategic process of establishing how a brand is represented, described, and recommended within AI systems — both in training data (GEO layer) and at inference time (AEO layer).

See also: AI Visibility Engineering, GEO, AEO

C 4 terms

A brand entity assigned a globally unique, machine-resolvable identifier in a Knowledge Graph — typically a Wikidata QID. A canonical entity is unambiguously distinguishable from all other entities regardless of how many different names or descriptions refer to it. Establishing canonical entities is the prerequisite for all AIMENSION Protocol interventions.

See also: Entity Ground Truth, QID (Wikidata)

The likelihood that a given LLM will cite, reference, or recommend a brand entity in a relevant AI-generated response. Increased by: higher frequency of entity mentions in training data, higher consistency of descriptions across sources, higher authority of sources containing entity mentions, and more complete semantic markup.

See also: AI Brand Authority, Entity Frequency

The structural design of web content to maximize AEO retrieval accuracy and GEO training signal quality. Prioritizes factual density over keyword frequency, self-contained semantic sections over flowing prose, explicit entity declarations over implicit references, and machine-readable markup over visual formatting.

See also: RAG Pipeline Optimization, Factual Density

The degree to which an entity's attributes are expressed identically across all sources — Wikidata, website JSON-LD, GitHub documentation, Crunchbase, LinkedIn, editorial mentions. The foundational requirement for Semantic Triangulation. Inconsistency creates ambiguous entity encodings in LLM training and reduces citation probability.

See also: Semantic Triangulation, sameAs

D 5 terms

A structured Knowledge Graph derived from Wikipedia through automated extraction of infobox data. DBpedia entities are linked to Wikidata and included in some LLM training corpora. For AI Visibility Engineering, DBpedia presence is secondary to Wikidata but contributes additional cross-source verification signals.

A Schema.org type used to define a term within a DefinedTermSet. In the AIMENSION Protocol, DefinedTerm entities establish the canonical definitions of proprietary vocabulary — the primary mechanism by which LLMs learn the precise meanings of brand-specific terminology.

See also: DefinedTermSet, JSON-LD, Schema.org

A Schema.org type representing a collection of DefinedTerm entities — a machine-readable glossary. In AIMENSION implementations, published at a dedicated glossary URL and referenced in the @graph JSON-LD of the main website. Establishes the brand's proprietary vocabulary as a machine-resolvable semantic namespace.

See also: DefinedTerm, JSON-LD

A set of standardized metadata elements (DC.title, DC.creator, DC.subject, DC.description etc.) used to describe web resources. Dublin Core metadata is recognized by academic aggregators and some LLM training pipelines that process web metadata, providing an additional ingestion signal layer on top of Schema.org markup.

E 7 terms

A dense numerical vector representation of a text chunk, used by vector databases to enable semantic similarity search. In RAG systems, both queries and indexed documents are converted to embeddings; retrieval finds documents whose embeddings are most similar to the query embedding.

See also: Vector Database, RAG

A distinct, identifiable real-world object — organization, person, product, location, concept, or event — that can be referenced by AI systems. The foundational challenge in AI Visibility Engineering: AI systems must recognize that "Axon System," "axonsystem.net," "Q139766166," and "the Italian AI Visibility Engineering agency" all refer to the same entity.

See also: Entity Resolution, Canonical Entity

The process of determining which real-world entity a text string refers to when multiple candidate entities could match. AIMENSION addresses disambiguation by creating unique, property-rich Wikidata nodes and linking to them via sameAs declarations across all web surfaces.

See also: Entity Resolution, sameAs, QID

The number of times an entity is mentioned across an LLM's training corpus. Higher entity frequency correlates with stronger parametric encoding. Can be increased through publication in high-weight domains (GitHub, academic repositories, authoritative directories) and editorial coverage.

See also: LLM Training Signal, Parametric Memory

The first pillar of the AIMENSION Protocol. A verified entity node in Wikidata serving as the canonical, machine-resolvable reference point for LLM entity resolution and RAG pipeline disambiguation. Must have: a globally unique QID, a complete set of properties with P854 references, and sameAs links to web presence and documentation. Without Entity Ground Truth, all other AIMENSION interventions lack a canonical anchor.

See also: QID (Wikidata), Knowledge Graph Engineering

A structured representation of an organization's entities and their relationships, expressed in JSON-LD using Schema.org types and the @graph construct. Typically includes Organization, Person, Product/Service, and DefinedTerm nodes connected by typed relations. The core deliverable of Pillar III (Semantic Injection).

See also: JSON-LD, Semantic Injection

The computational process by which an AI system identifies the real-world entity referred to by a text string and links it to a canonical entity record. AIMENSION interventions — sameAs declarations and Wikidata QID references — are designed to maximize entity resolution accuracy.

See also: Entity Disambiguation, sameAs

F 3 terms

A Schema.org type for marking up Q&A pairs in JSON-LD. One of the highest-value AEO interventions because it maps directly to question-answering query patterns, is processed by Google's Knowledge Graph crawler, and is preferentially retrieved by RAG systems answering user questions.

See also: AEO, JSON-LD, RAG

The ratio of verifiable, specific factual claims to total content volume. High factual density increases both RAG retrieval relevance and LLM training signal quality. AI systems learn entity attributes from factual statements; they cannot learn from adjective-heavy marketing language.

See also: Content Architecture, LLM Training Signal

The structural competitive benefit accrued by organizations that establish Knowledge Graph presence, semantic markup consistency, and structured documentation before competitors. Durable because: Knowledge Graphs are not retroactive, established entity encodings persist across model generations, and early presence creates higher frequency in cumulative training corpora.

See also: GEO, Wikidata

G 4 terms

Strategic optimization of brand signals at the training-data level to influence how generative AI models represent, describe, and recommend a brand — independently of inference-time retrieval. GEO targets the pre-training and fine-tuning stages during which the model encodes statistical patterns about entities. GEO interventions have no immediate effect but produce structural, durable results: knowledge encoded in model weights persists across all inferences until the model is retrained.

See also: AEO, Parametric Memory, LLM Training Signal

Google's proprietary Knowledge Graph powering Knowledge Panels and Google SGE. Primarily fed by Wikidata, Wikipedia, and JSON-LD on websites. A brand with a complete Wikidata entity and consistent JSON-LD is significantly more likely to receive a Knowledge Panel and be cited accurately by Google SGE.

Google's AI-powered search feature generating synthesized answers at the top of search results. Draws from Google's Knowledge Graph, indexed web content, and LLM capabilities. The AEO layer within Google's ecosystem.

The specific file and folder architecture recommended by the AIMENSION Protocol for public technical repositories used as Algorithmic Authority assets. Includes: README.md (entity table with Wikidata QIDs, BibTeX block), llms.txt, SPECIFICATION.md, CHANGELOG.md, /docs, /schemas, /examples, /validators.

See also: Algorithmic Authority, llms.txt

I 1 term

The process by which a deployed AI system retrieves relevant documents at the moment of generating a response, rather than relying solely on training weights. The mechanism underlying RAG-augmented AI systems. AEO specifically targets inference-time retrieval.

See also: AEO, RAG, Parametric Memory

J 1 term

JSON for Linked Data — a W3C standard for expressing structured data in JSON format linking to Schema.org vocabulary. Primary implementation format for Semantic Injection (Pillar III). Embedded in the <head> as <script type="application/ld+json"> and processed by Google's Knowledge Graph crawler, Bing's entity resolution system, and LLM training pipelines. The critical AIMENSION mechanism is the sameAs array linking entities to Wikidata QIDs.

See also: Schema.org, Semantic Injection, sameAs

K 3 terms

A structured database of entities and their relationships, represented as a graph. Foundational to AI Visibility Engineering because: explicitly included in LLM training corpora, used by search engines for entity-aware features, and queryable by AI systems for real-time entity verification. The most important Knowledge Graph for AI brand visibility is Wikidata.

The practice of creating, populating, and maintaining organization entity nodes in global Knowledge Graphs. Encompasses: entity creation, property mapping, reference management, relationship establishment, and ongoing consistency monitoring via SPARQL. The core activity of Pillar I (Entity Ground Truth) in AIMENSION.

See also: Wikidata, Entity Ground Truth, SPARQL

The information box in Google Search results for recognized entities, generated from Google Knowledge Graph data primarily sourced from Wikidata and website JSON-LD. A visible signal of entity authority confirming Google's entity resolution system has recognized the organization as a verified real-world entity.

See also: Google Knowledge Graph, Wikidata

L 4 terms

A neural network trained on massive text corpora to predict and generate natural-language text. The underlying technology of ChatGPT, Gemini, Claude, and Llama. Key properties for AI Visibility Engineering: knowledge is encoded in weights during training and frozen; structured data (Wikidata, JSON-LD) is encoded with higher fidelity than unstructured prose; citation behavior is determined by statistical patterns learned during training, not real-time web ranking.

Any data element in an LLM's training corpus contributing to the model's encoded knowledge about an entity. High-quality signals include: Wikidata property statements, JSON-LD markup on official websites, technical documentation in GitHub repositories, citations in academic or technical publications.

See also: GEO, Entity Frequency, Factual Density

A plain-text file at a domain root providing machine-readable directives for AI crawlers — analogous to robots.txt. Specifies: canonical entity declarations with Wikidata QIDs, authoritative source URLs, content licensing permissions for AI training, exclusions, and citation format. In AIMENSION, deployed at both the domain root and the GitHub repository root.

See also: AEO, Algorithmic Authority

The percentage of AI-generated answers in a given category that mention, recommend, or cite a specific brand. A forward-looking competitive metric measuring a brand's position in the AI answer economy. Quantified in Macro-Sector 5 of the AIMENSION Protocol full audit.

See also: GEO, AEO

M 1 term

The process by which AI systems process structured metadata (JSON-LD, Wikidata properties, RDF triples) and incorporate it into their knowledge representations. The mechanism through which Semantic Injection (Pillar III) produces AI visibility effects. Metadata ingestion enables higher-fidelity entity encoding than content ingestion of natural-language text.

See also: Semantic Injection, JSON-LD

P 4 terms

The knowledge encoded in an LLM's weights during training, as opposed to knowledge retrieved from external sources at inference time. GEO specifically targets parametric memory: optimizing brand signals in training corpora so the model's encoded knowledge about the brand is accurate, complete, and accessible.

See also: GEO, LLM Training Signal

The most critical Wikidata property — specifies what type of entity an item is. Determines how the entity is classified in Knowledge Graph queries. For AIMENSION: organizations use Q4830453 (business), persons use Q5 (human), methodologies use Q1172812 (methodology). Correct P31 is required for category-based Knowledge Graph queries.

A Wikidata property specifying the official website URL. One of the most important properties for AI Visibility Engineering — creates a verified link between the Wikidata entity (GEO anchor) and the website (AEO surface), completing the cross-reference loop with JSON-LD sameAs.

A typed attribute in the Wikidata Knowledge Graph expressing a fact about an entity. Identified by "P" numbers (P31 = instance of, P856 = official website, P112 = founder, P108 = employer). The primary mechanism for expressing entity attributes in machine-verifiable format.

Q 1 term

A globally unique numeric identifier assigned to each Wikidata entity, prefixed with "Q". The canonical identifiers used in sameAs declarations, JSON-LD entity graphs, and llms.txt files to unambiguously reference real-world entities. Every principal entity must have a QID as the prerequisite for Semantic Triangulation. Axon System: Q139766166 · Francesco Tinti: Q139765600 · AIMENSION Protocol: Q139783726

See also: Wikidata, Canonical Entity, sameAs

R 2 terms

An AI architecture augmenting a language model with a retrieval system that fetches relevant documents before generating a response. Process: (1) query encoded as embedding, (2) similar document embeddings retrieved from vector database, (3) retrieved documents provided as context to LLM, (4) LLM generates grounded response. The standard architecture for enterprise AI assistants and AI-enhanced search.

See also: AEO, Inference-Time Retrieval, Embedding

Content structure and metadata interventions maximizing brand retrieval accuracy in RAG-augmented AI systems. Key interventions: chunk-friendly content structure, high factual density, FAQ Schema markup, llms.txt directives, JSON-LD with sameAs references. Spans Pillars II and III of AIMENSION.

See also: RAG, Factual Density, llms.txt

S 8 terms

A Schema.org property declaring that two URIs refer to the same real-world entity. The primary cross-referencing mechanism of AIMENSION Semantic Triangulation: by declaring "sameAs": ["https://www.wikidata.org/wiki/Q139766166"] in a website's JSON-LD, the page explicitly instructs every entity resolver that these URIs represent the same real-world organization. The technical mechanism that closes the triangulation loop between all three pillars.

See also: JSON-LD, Semantic Triangulation, QID

A shared semantic vocabulary maintained by Google, Microsoft, Yahoo, and Yandex for structured markup of web content. The foundational vocabulary for Semantic Injection (Pillar III). Key types in AIMENSION: Organization, Person, Service, FAQPage, DefinedTermSet, DefinedTerm, Dataset. Expressed in JSON-LD format.

See also: JSON-LD, Semantic Injection

The degree to which an entity is recognized as the canonical, authoritative source of truth on a given topic by AI systems and Knowledge Graph resolvers. Built through the AIMENSION Semantic Triangulation. High semantic authority means AI systems default to citing the entity as the authoritative source for relevant queries.

See also: Semantic Triangulation, Source of Truth (AI)

The third pillar of the AIMENSION Protocol. Implementation of Schema.org and JSON-LD markup across all digital properties, making every web surface a machine-readable identity declaration. Includes: Organization/Person/CreativeWork entity graphs, FAQPage markup, DefinedTermSet for proprietary vocabulary, and JSON-LD endpoint at /[entity].jsonld. Critical mechanism: the sameAs array linking to Wikidata QIDs (Pillar I) and GitHub URLs (Pillar II), completing Semantic Triangulation.

See also: JSON-LD, Schema.org, sameAs, Semantic Triangulation

The core mechanism of the AIMENSION Protocol: entity authority through three mutually-reinforcing, cross-verifiable signals. The three vertices: (1) Wikidata entity node with P856 → website and P1324 → GitHub; (2) GitHub documentation linking to Wikidata QIDs and declaring canonical entities via llms.txt; (3) Website JSON-LD with sameAs declaring Wikidata QID and GitHub URL. When an AI encounters any one vertex, it can navigate to all three — creating a web of mutual verification. Remove any one pillar and the triangulation fails.

See also: AIMENSION Protocol, Entity Ground Truth, Algorithmic Authority, Semantic Injection

An extension of the WWW where data is structured for machine interpretation, based on RDF, OWL, SPARQL, and Schema.org/JSON-LD. AI Visibility Engineering leverages Semantic Web infrastructure — Wikidata, Schema.org, JSON-LD — to establish machine-verifiable brand identity.

A brand entity whose consistent, cross-verified presence across authoritative data sources — Wikidata, structured documentation repositories, and Schema.org markup — creates sufficient statistical signal for reliable, accurate, and consistent citation by Large Language Models. Achieving Source of Truth status is the primary strategic objective of AIMENSION Protocol implementations.

See also: AI Brand Authority, Semantic Triangulation

The standard query language for RDF-based Knowledge Graphs including Wikidata. In AIMENSION: used to monitor entity property completeness, verify cross-reference integrity, count incoming citations, and detect inconsistencies. Publicly accessible at query.wikidata.org/sparql.

See also: Wikidata, Knowledge Graph Engineering

T 2 terms

The collection of text and structured data used to train a Large Language Model. Typically includes: web crawls, Wikipedia/Wikidata dumps, books, academic papers, code repositories, and curated instruction datasets. GEO interventions target the training corpus layer: ensuring brand entities are present in high-weight corpus sources before model training occurs.

See also: GEO, LLM Training Signal

The date after which new data was not included in an LLM's training corpus. Creates urgency for AI Visibility Engineering: every training cycle is an opportunity to establish brand presence. Entities not in the training data before the cutoff must wait for the next model generation to appear in parametric memory.

See also: GEO, Parametric Memory

V 1 term

A database storing and querying dense vector embeddings, used as the retrieval component of RAG systems. For AEO, brand content must be structured to generate embeddings that rank high in similarity search for relevant queries.

See also: RAG, Embedding

W 1 term

The largest open, freely licensed Knowledge Graph, maintained by the Wikimedia Foundation. Contains over 100 million structured items. Wikidata is: explicitly used as a training signal by major LLM providers; the primary structured data source for Google Knowledge Graph and Bing Entity Search; the foundational structured data layer of Wikipedia; and included in virtually all major LLM training datasets. In AIMENSION, Wikidata is the implementation platform for Pillar I (Entity Ground Truth).

See also: Knowledge Graph, Entity Ground Truth, QID, SPARQL