快捷導(dǎo)航

elasticsearch索引index之Mapping實(shí)現(xiàn)關(guān)系結(jié)構(gòu)示例

更新時(shí)間：2022年04月22日 10:31:42 作者：zziawan

這篇文章主要介紹了elasticsearch索引index之Mapping實(shí)現(xiàn)關(guān)系結(jié)構(gòu)示例，有需要的朋友可以借鑒參考下，希望能夠有所幫助，祝大家多多進(jìn)步，早日升職加薪

Mapping的實(shí)現(xiàn)關(guān)系結(jié)構(gòu)

Lucene索引的一個(gè)特點(diǎn)就filed，索引以field組合。這一特點(diǎn)為索引和搜索提供了很大的靈活性。elasticsearch則在Lucene的基礎(chǔ)上更近一步，它可以是 no scheme。實(shí)現(xiàn)這一功能的秘密就Mapping。Mapping是對(duì)索引各個(gè)字段的一種預(yù)設(shè)，包括索引與分詞方式，是否存儲(chǔ)等，數(shù)據(jù)根據(jù)字段名在Mapping中找到對(duì)應(yīng)的配置，建立索引。這里將對(duì)Mapping的實(shí)現(xiàn)結(jié)構(gòu)簡(jiǎn)單分析，Mapping的放置、更新、應(yīng)用會(huì)在后面的索引fenx中進(jìn)行說(shuō)明。

這只是Mapping中的一部分內(nèi)容。Mapping擴(kuò)展了lucene的filed，定義了更多的field類型既有Lucene所擁有的string，number等字段又有date，IP，byte及geo的相關(guān)字段，這也是es的強(qiáng)大之處。如上圖所示，可以分為兩類，mapper與documentmapper，前者是所有mapper的父接口。而DocumentMapper則是Mapper的集合，它代表了一個(gè)索引的mapper定義。

Mapper的三類

第一類就是核心field結(jié)構(gòu)FileMapper—>AbstractFieldMapper—>StringField這種核心數(shù)據(jù)類型，它代表了一類數(shù)據(jù)類型，如字符串類型，int類型這種；

第二類是Mapper—>ObjectMapper—>RootObjectMapper,object類型的Mapper，這也是elasticsearch對(duì)lucene的一大改進(jìn)，不想lucene之支持基本數(shù)據(jù)類型；

最后一類是Mapper—>RootMapper—>IndexFieldMapper這種類型，只存在于根Mapper中的一種Mapper，如IdFieldMapper及圖上的IndexFieldMapper，它們類似于index的元數(shù)據(jù)，只可能存在于某個(gè)index內(nèi)部。

parse方法

Mapper中一個(gè)比較重要的方法就是parse(ParseContext context)，Mapper的子類對(duì)這個(gè)方法都有各自的實(shí)現(xiàn)。它的主要功能是通過(guò)解析ParseContext獲取到對(duì)應(yīng)的field，這個(gè)方法主要用于建立索引時(shí)。索引數(shù)據(jù)被繼續(xù)成parsecontext，每個(gè)field解析parseContext構(gòu)建對(duì)應(yīng)的lucene Field。它在AbstractFieldMapper中的實(shí)現(xiàn)如下所示：

public void parse(ParseContext context) throws IOException {
        final List&lt;Field&gt; fields = new ArrayList&lt;&gt;(2);
        try {
            parseCreateField(context, fields);//實(shí)際Filed解析方法
            for (Field field : fields) {
                if (!customBoost()) {//設(shè)置boost
                    field.setBoost(boost);
                }
                if (context.listener().beforeFieldAdded(this, field, context)) {
                    context.doc().add(field);//將解析完成的Field加入到context中
                }
            }
        } catch (Exception e) {
            throw new MapperParsingException("failed to parse [" + names.fullName() + "]", e);
        }
        multiFields.parse(this, context);//進(jìn)行mutiFields解析，MultiFields作用是對(duì)同一個(gè)field做不同的定義，如可以進(jìn)行不同分詞方式的索引這樣便于通過(guò)各種方式查詢
        if (copyTo != null) {
            copyTo.parse(context);
        }
    }

這里的parseCreateField是一個(gè)抽象方法，每種數(shù)據(jù)類型都有自己的實(shí)現(xiàn)，如string的實(shí)現(xiàn)方式如下所示：

protected void parseCreateField(ParseContext context, List&lt;Field&gt; fields) throws IOException {
        ValueAndBoost valueAndBoost = parseCreateFieldForString(context, nullValue, boost);//解析成值和boost
        if (valueAndBoost.value() == null) {
            return;
        }
        if (ignoreAbove &gt; 0 &amp;&amp; valueAndBoost.value().length() &gt; ignoreAbove) {
            return;
        }
        if (context.includeInAll(includeInAll, this)) {
            context.allEntries().addText(names.fullName(), valueAndBoost.value(), valueAndBoost.boost());
        }
        if (fieldType.indexed() || fieldType.stored()) {//構(gòu)建LuceneField
            Field field = new Field(names.indexName(), valueAndBoost.value(), fieldType);
            field.setBoost(valueAndBoost.boost());
            fields.add(field);
        }
        if (hasDocValues()) {
            fields.add(new SortedSetDocValuesField(names.indexName(), new BytesRef(valueAndBoost.value())));
        }
        if (fields.isEmpty()) {
            context.ignoredValue(names.indexName(), valueAndBoost.value());
        }
    }
//解析出字段的值和boost
    public static ValueAndBoost parseCreateFieldForString(ParseContext context, String nullValue, float defaultBoost) throws IOException {
        if (context.externalValueSet()) {
            return new ValueAndBoost((String) context.externalValue(), defaultBoost);
        }
        XContentParser parser = context.parser();
        if (parser.currentToken() == XContentParser.Token.VALUE_NULL) {
            return new ValueAndBoost(nullValue, defaultBoost);
        }
        if (parser.currentToken() == XContentParser.Token.START_OBJECT) {
            XContentParser.Token token;
            String currentFieldName = null;
            String value = nullValue;
            float boost = defaultBoost;
            while ((token = parser.nextToken()) != XContentParser.Token.END_OBJECT) {
                if (token == XContentParser.Token.FIELD_NAME) {
                    currentFieldName = parser.currentName();
                } else {
                    if ("value".equals(currentFieldName) || "_value".equals(currentFieldName)) {
                        value = parser.textOrNull();
                    } else if ("boost".equals(currentFieldName) || "_boost".equals(currentFieldName)) {
                        boost = parser.floatValue();
                    } else {
                        throw new ElasticsearchIllegalArgumentException("unknown property [" + currentFieldName + "]");
                    }
                }
            }
            return new ValueAndBoost(value, boost);
        }
        return new ValueAndBoost(parser.textOrNull(), defaultBoost);
    }

以上就是Mapper如何將一個(gè)值解析成對(duì)應(yīng)的Field的過(guò)程，這里只是簡(jiǎn)單介紹，后面會(huì)有詳細(xì)分析。

部分Field

DocumentMapper是一個(gè)索引所有Mapper的集合，它表述了一個(gè)索引所有field的定義，可以說(shuō)是lucene的Document的定義，同時(shí)它還包含以下index的默認(rèn)值，如index和search時(shí)默認(rèn)分詞器。它的部分Field如下所示：

private final DocumentMapperParser docMapperParser;
    private volatile ImmutableMap&lt;String, Object&gt; meta;
    private volatile CompressedString mappingSource;
    private final RootObjectMapper rootObjectMapper;
    private final ImmutableMap&lt;Class&lt;? extends RootMapper&gt;, RootMapper&gt; rootMappers;
    private final RootMapper[] rootMappersOrdered;
    private final RootMapper[] rootMappersNotIncludedInObject;
    private final NamedAnalyzer indexAnalyzer;
    private final NamedAnalyzer searchAnalyzer;
    private final NamedAnalyzer searchQuoteAnalyzer;

DocumentMapper的功能也體現(xiàn)在parse方法上，它的作用是解析整條數(shù)據(jù)。之前在Mapper中看到了Field是如何解析出來(lái)的，那其實(shí)是在DocumentMapper解析之后。index請(qǐng)求發(fā)過(guò)來(lái)的整條數(shù)據(jù)在這里被解析出Field，查找Mapping中對(duì)應(yīng)的Field設(shè)置，交給它去解析。如果沒(méi)有且運(yùn)行動(dòng)態(tài)添加，es則會(huì)根據(jù)值自動(dòng)創(chuàng)建一個(gè)Field同時(shí)更新Mapping。方法代碼如下所示：

public ParsedDocument parse(SourceToParse source, @Nullable ParseListener listener) throws MapperParsingException {
        ParseContext.InternalParseContext context = cache.get();
        if (source.type() != null &amp;&amp; !source.type().equals(this.type)) {
            throw new MapperParsingException("Type mismatch, provide type [" + source.type() + "] but mapper is of type [" + this.type + "]");
        }
        source.type(this.type);
        XContentParser parser = source.parser();
        try {
            if (parser == null) {
                parser = XContentHelper.createParser(source.source());
            }
            if (sourceTransforms != null) {
                parser = transform(parser);
            }
            context.reset(parser, new ParseContext.Document(), source, listener);
            // will result in START_OBJECT
            int countDownTokens = 0;
            XContentParser.Token token = parser.nextToken();
            if (token != XContentParser.Token.START_OBJECT) {
                throw new MapperParsingException("Malformed content, must start with an object");
            }
            boolean emptyDoc = false;
            token = parser.nextToken();
            if (token == XContentParser.Token.END_OBJECT) {
                // empty doc, we can handle it...
                emptyDoc = true;
            } else if (token != XContentParser.Token.FIELD_NAME) {
                throw new MapperParsingException("Malformed content, after first object, either the type field or the actual properties should exist");
            }
            // first field is the same as the type, this might be because the
            // type is provided, and the object exists within it or because
            // there is a valid field that by chance is named as the type.
            // Because of this, by default wrapping a document in a type is
            // disabled, but can be enabled by setting
            // index.mapping.allow_type_wrapper to true
            if (type.equals(parser.currentName()) &amp;&amp; indexSettings.getAsBoolean(ALLOW_TYPE_WRAPPER, false)) {
                parser.nextToken();
                countDownTokens++;
            }
            for (RootMapper rootMapper : rootMappersOrdered) {
                rootMapper.preParse(context);
            }
            if (!emptyDoc) {
                rootObjectMapper.parse(context);
            }
            for (int i = 0; i &lt; countDownTokens; i++) {
                parser.nextToken();
            }
            for (RootMapper rootMapper : rootMappersOrdered) {
                rootMapper.postParse(context);
            }
        } catch (Throwable e) {
            // if its already a mapper parsing exception, no need to wrap it...
            if (e instanceof MapperParsingException) {
                throw (MapperParsingException) e;
            }
            // Throw a more meaningful message if the document is empty.
            if (source.source() != null &amp;&amp; source.source().length() == 0) {
                throw new MapperParsingException("failed to parse, document is empty");
            }
            throw new MapperParsingException("failed to parse", e);
        } finally {
            // only close the parser when its not provided externally
            if (source.parser() == null &amp;&amp; parser != null) {
                parser.close();
            }
        }
        // reverse the order of docs for nested docs support, parent should be last
        if (context.docs().size() &gt; 1) {
            Collections.reverse(context.docs());
        }
        // apply doc boost
        if (context.docBoost() != 1.0f) {
            Set&lt;String&gt; encounteredFields = Sets.newHashSet();
            for (ParseContext.Document doc : context.docs()) {
                encounteredFields.clear();
                for (IndexableField field : doc) {
                    if (field.fieldType().indexed() &amp;&amp; !field.fieldType().omitNorms()) {
                        if (!encounteredFields.contains(field.name())) {
                            ((Field) field).setBoost(context.docBoost() * field.boost());
                            encounteredFields.add(field.name());
                        }
                    }
                }
            }
        }
        ParsedDocument doc = new ParsedDocument(context.uid(), context.version(), context.id(), context.type(), source.routing(), source.timestamp(), source.ttl(), context.docs(), context.analyzer(),
                context.source(), context.mappingsModified()).parent(source.parent());
        // reset the context to free up memory
        context.reset(null, null, null, null);
        return doc;
    }

將整條數(shù)據(jù)解析成ParsedDocument，解析后的數(shù)據(jù)才能進(jìn)行后面的Field解析建立索引。

總結(jié)

以上就是Mapping的結(jié)構(gòu)和相關(guān)功能概括，Mapper賦予了elasticsearch索引的更強(qiáng)大功能，使得索引和搜索可以支持更多數(shù)據(jù)類型，靈活性更高，更多關(guān)于elasticsearch索引index Mapping關(guān)系結(jié)構(gòu)的資料請(qǐng)關(guān)注腳本之家其它相關(guān)文章！

您可能感興趣的文章: