object vs nested type in data mapping in Elasticsearch

jdhao's digital space

Conversion between base64 and OpenCV or PIL Image 腾讯云对象存储博客图床开启 CDN 加速(不需要购买额外域名) Search and Replace in Multiple Files in Vim/Neovim Change Table Column Width in LaTeX Image or Table Side by Side in LaTeX LaTeX 并排显示图像或表格 Firenvim: Neovim inside Your Browser Content inside HTML tags missing in Latest Hugo? Creating Markdown Front Matter with Ultisnips Labelme JSON 标注格式转 voc XML 格式 Nifty Nvim Techniques That Make My Life Easier -- Series 6 macOS 下如何为视频制作字幕 Running Command Asynchronously inside Neovim Resolving Merge Conflict after Git Stash Pop Pylint: command not found? A Hands-on Experience with Neovim's Built-in LSP Support How to Convert PDF to Images with Imagemagick 互联网上常用缩略语集锦 File Backup in Neovim Converting PDF Pages to Images with Poppler Nifty Nvim Techniques That Make My Life Easier -- Series 5 Neovim Configuration for System-wide Use How to sort a list of tuple or list in Python -- lambda or itemgetter? Building A Vim Statusline from Scratch 人类第一颗原子弹爆炸始末 Distributed Training in PyTorch with Horovod Learning Expect Programming Essential Knowledge about SSH Nifty LaTeX Techniques -- Series 1 更改 Adsense 邮寄地址，重新寄送 PIN Mintty Tips and Configurations Generating Table of Contents for Markdown with Tagbar Convert Python Script to Exe on Windows with Pyinstaller Ubuntu on Windows Missing after Windows Update 使用代理加速 Mac 终端下载速度 My Experience with Several Zsh Plugin Managers 深圳租房小记 How to Install zplug inside Docker Container Why don't settings inside bashrc or bash_profile take effect? Setting Up Locale in Linux 谷歌 Adsense 申请及在 Hugo 中的配置 How to Write Algorithm Pseudo Code in LaTeX Nifty Nvim Techniques That Make My Life Easier -- Series 4 A Few Grammar Questions in Writing How to Read and Write Images with Unicode Paths in OpenCV Creating A Professional Table in LaTeX with booktabs How to Create Proper Folding for Vim/Nvim Configuration Linux Tips and Tricks -- s1 JPEG Image Orientation and Exif How Do I Show the Current File Path In Neovim? JPEG Image Quality in PIL Difference between view, reshape, transpose and permute in PyTorch Convert PIL or OpenCV Image to Bytes without Saving to Disk Fast Movement and Navigation Inside Vim or Neovim Unintuitive Behaviour of Case Sensitivity in Python glob Binding Keys in Zsh 几把机械键盘试用体验 Nvim Autocompletion with Deoplete Converting Markdown to Beautiful PDF with Pandoc Exclusive and Inclusive Motion in Neovim/Vim Nifty Nvim Techniques Which Make My Life Easier -- Series 3 Why Doesn't Jedi Autocompletion Work for Some Methods Vim-like Editing inside Browser Markdown 生成 HTML 时汉字之间出现多余空格问题小米 9 安装谷歌商店（Google Play Store）与相关配置 Create Mappings That Take A Count in Neovim Spell Checking in Nvim English Words Completion inside Neovim/Vim How to Use Python Inside Vim Script with Neovim Nifty Little Nvim Techniques to Make My Life Easier -- Series 2 Setting up Ultisnips for Neovim Mac 上罗技 M590 鼠标设置 Nifty Little Nvim Techniques to Make My Life Easier -- Series 1 A Complete Guide on Writing LaTeX with Vimtex in Neovim Manipulating Images with Alpha Channels in Pillow Sublime Text Regular Expression Cheat Sheet Cropping Rotated Rectangles from Image with OpenCV Boosting Your Productivity on Terminal with Zsh and Plugins 最新版 Rime 输入法使用 (2022 更新) Display Image with Pillow inside Ubuntu on Windows Faster Directory Navigation with z.lua Cmder Advanced Configurations Nvim-qt Settings on Windows 10 Tmux Plugin Install and Management How to Debug Python Code in Terminal Markdown Writing and Previewing in Neovim -- A Complete Guide Line Number Settings for More Efficient Movement in Neovim 两个大规模中文语料库介绍以及处理 Windows 系统下几款程序员不可不用的神器我的 2018 阅读清单 A Complete Guide to Neovim Configuration for Python Development How Is Newline Handled in Python and Various Editors? Two Issues Related to ImageFont Module in PIL 在 Listary 中调用 GoldenDict 或欧路词典查单词 Reading and Writing Text Files on Windows The Mathematics behind Font Shapes --- Bézier Curves and More 快速识别图片字体：字体识别工具介绍 Deoplete Failed to Load at Startup after Updating Python neovim Package What Is The Difference between pip, pip3 and pip3.6 Shipped with Anaconda3? Windows 10 系统下 Neovim 安装与配置

2025-11-20 · via jdhao's digital space

In this post, I compare the object vs nested type used in data mapping in Elasticsearch.

By default, if you have a field where value is a list of dictionary type itself, the field is indexes by Elastic as object type. The structure of each dict under the field is not preserved.

Let’s have a concrete example:

DELETE new_index
PUT new_index/_doc/1
{
  "name": [
    {
      "first": "alice",
      "last": "smith"
    },
    {
      "first": "john",
      "last": "white"
    }
  ]
}

Internally, the document is flattened to something like this:

{
  "name.first": ["alice", "john"],
  "name.last": ["smith", "white"]
}

To verify this, let’s add a second document:

PUT new_index/_doc/2
{
  "name": [
    {
      "first": "alice",
      "last": "white"
    },
    {
      "first": "john",
      "last": "smith"
    }
  ]
}

Then we search the index to find document where “name.first” is alice, and “name.last” is white:

GET new_index/_search
{
  "query": {
    "bool": {
      "must": [
        {
          "term": {
            "name.first": "alice"
          }
        },
        {
          "term": {
            "name.last": "white"
          }
        }
      ]
    }
  }
}

You would expect that document with id=2 is returned, however, both document 1 and 2 are returned.

nested type mapping#

In order to correctly preserve structure of inner dictionary under the field, we need to define the “name” field as nested type. In this case, we need to explicitly setting the mapping for the “name” field before adding documents.

DELETE my_index

PUT my_index
{
  "mappings": {
    "properties": {
      "name": {
        "type": "nested"
      },
      "attribute": {
        "type": "nested",
        "properties": {
          "name": {
            "type": "keyword"
          },
          "value": {
            "type": "keyword"
          }
        }
      }
    }
  }
}

Then let’s add two documents to this index:

PUT my_index/_doc/1
{
  "name": [
    {
      "first": "alice",
      "last": "smith"
    },
    {
      "first": "john",
      "last": "white"
    }
  ],
  "attribute": [
    {
      "name": "size",
      "value": "23"
    },
    {
      "name": "color",
      "value": "blue"
    }
  ]
}

PUT my_index/_doc/2
{
  "name": [
    {
      "first": "alice",
      "last": "white"
    },
    {
      "first": "john",
      "last": "smith"
    }
  ]
}

Now you can try to find documents where name.first is alice and name.last is white. Note that however, you need to use nested query instead of plain one above:

GET my_index/_search
{
  "query": {
    "nested": {
      "path": "name",
      "query": {
        "bool": {
          "must": [
            {
              "term": {
                "name.first": "alice"
              }
            },
            {
              "term": {
                "name.last": "white"
              }
            }
          ]
        }
      }
    }
  }
}

Now only document 2 is returned in the result.

nested vs object type#

If you define a field as nested type, internally each dict under this field is stored as a separate Lucene document. It is just on the surface, you see one document when you do the normal search.

In the output of cat-indices api, there is this docs.count field, which shows the number of Lucene documents that this index has. In the above example, field name in index new_index is object type, and we indexed 2 documents to this index. If you run the cat-indices api for new_index, you see docs.count is 2.

GET _cat/indices/new_index?v

health status index     uuid                   pri rep docs.count docs.deleted store.size pri.store.size dataset.size
yellow open   new_index dtWNUHroQ_OQkIRbQj8Bvw   1   1          2            0     10.9kb         10.9kb       10.9kb

For index my_index, both field name and attribute is defined as nested type, we indexed 2 documents to this index. The cat-index API shows that docs.stat is 8.

If you are interested in only the number of documents you indexed to an index, you can use the get-count API.

GET my_index/_count
GET new_index/_count

此内容由惯性聚合(RSS阅读器)自动聚合整理，仅供阅读参考。原文来自 — 版权归原作者所有。

推荐订阅源

jdhao's digital space

nested type mapping#

nested vs object type#