连接 | Manticore Search Manual

当你通过SQL在MySQL协议下运行查询时，你将收到请求的列作为结果，或者如果未找到任何内容，则为空的结果集。

‹›

SQL
JSON

📋

SELECT * FROM tbl;

‹›

Response

+------+------+--------+
| id   | age  | name   |
+------+------+--------+
|    1 |   25 | joe    |
|    2 |   25 | mary   |
|    3 |   33 | albert |
+------+------+--------+
3 rows in set (0.00 sec)

此外，你还可以使用 SHOW META 调用来查看最新查询的额外元信息。

‹›

SQL
JSON

📋

SELECT id,story_author,comment_author FROM hn_small WHERE story_author='joe' LIMIT 3; SHOW META;

‹›

Response

++--------+--------------+----------------+
| id     | story_author | comment_author |
+--------+--------------+----------------+
| 152841 | joe          | SwellJoe       |
| 161323 | joe          | samb           |
| 163735 | joe          | jsjenkins168   |
+--------+--------------+----------------+
3 rows in set (0.01 sec)
+----------------+-------+
| Variable_name  | Value |
+----------------+-------+
| total          | 3     |
| total_found    | 20    |
| total_relation | gte   |
| time           | 0.010 |
+----------------+-------+
4 rows in set (0.00 sec)

在某些情况下，例如执行分面搜索时，你可能会收到多个结果集作为对SQL查询的响应。

‹›

SQL
JSON

📋

SELECT * FROM tbl WHERE MATCH('joe') FACET age;

‹›

Response

+------+------+
| id   | age  |
+------+------+
|    1 |   25 |
+------+------+
1 row in set (0.00 sec)
+------+----------+
| age  | count(*) |
+------+----------+
|   25 |        1 |
+------+----------+
1 row in set (0.00 sec)

‹›

如果出现警告，结果集将包含警告标志，你可以使用 SHOW WARNINGS 查看警告。

‹›

SQL
JSON

📋

SELECT * from tbl where match('"joe"/3'); show warnings;

‹›

Response

+------+------+------+
| id   | age  | name |
+------+------+------+
|    1 |   25 | joe  |
+------+------+------+
1 row in set, 1 warning (0.00 sec)
+---------+------+--------------------------------------------------------------------------------------------+
| Level   | Code | Message                                                                                    |
+---------+------+--------------------------------------------------------------------------------------------+
| warning | 1000 | quorum threshold too high (words=1, thresh=3); replacing quorum operator with AND operator |
+---------+------+--------------------------------------------------------------------------------------------+
1 row in set (0.00 sec)

如果查询失败，你将收到一个错误：

‹›

SQL
JSON

📋

SELECT * from tbl where match('@surname joe');

‹›

Response

ERROR 1064 (42000): index idx: query error: no field 'surname' found in schema

通过HTTP JSON接口，查询结果将以JSON文档的形式发送。示例：

{
  "took":10,
  "timed_out": false,
  "hits":
  {
    "total": 2,
    "hits":
    [
      {
        "_id": 1,
        "_score": 1,
        "_source": { "gid": 11 }
      },
      {
        "_id": 2,
        "_score": 1,
        "_source": { "gid": 12 }
      }
    ]
  }
}

took：执行搜索所花费的毫秒数
timed_out：查询是否超时
hits：搜索结果，具有以下属性：
- total：匹配文档的总数
- hits：包含匹配项的数组

查询结果还可以包括查询剖析信息。请参阅查询剖析。

hits数组中的每个匹配项具有以下属性：

_id：匹配ID
_score：匹配权重，由排名器计算
_source：包含此匹配项属性的数组

默认情况下，所有属性都会返回在 _source 数组中。你可以在请求负载中使用 _source 属性来选择你想要包含在结果集中的字段。示例：

{
  "table":"test",
  "_source":"attr*",
  "query": { "match_all": {} }
}

你可以指定你想要包含在查询结果中的属性作为字符串（"_source": "attr*"）或作为字符串数组（"_source": [ "attr1", "attri*" ]"）。每个条目可以是属性名或通配符（*，% 和 ? 符号受支持）。

你也可以显式指定你想要包含和排除的属性，使用 includes 和 excludes 属性：

"_source":
{
  "includes": [ "attr1", "attri*" ],
  "excludes": [ "*desc*" ]
}

空的 includes 列表被解释为“包含所有属性”，而空的 excludes 列表不匹配任何内容。如果一个属性同时匹配 includes 和 excludes，则 excludes 会获胜。

过滤器

Last modified: April 13, 2026

WHERE 是一个 SQL 子句，适用于全文匹配和额外的过滤。以下操作符可用：

比较操作符 <, >, <=, >=, =, <>, BETWEEN, IN, IS NULL
布尔操作符 AND, OR, NOT

MATCH('query') 受支持，并映射到一个全文查询。

{col_name | expr_alias} [NOT] IN @uservar 条件语法受支持。请参阅 SET 语法以了解全局用户变量的描述。

如果您更喜欢 HTTP JSON 接口，也可以应用过滤。它可能看起来比 SQL 更复杂，但在需要程序化准备查询的情况下（例如，当用户在您的应用程序中填写表单时），它被推荐使用。

以下是一个 bool 查询中几个过滤器的示例。

此全文查询匹配所有包含 product 的字段的文档。这些文档必须具有价格大于或等于 500（gte）且小于或等于 1000（lte）。所有这些文档都必须没有小于 15 的修订版本（lt）。

‹›

JSON

JSON

📋

⚙

POST /search
{
  "table": "test1",
  "query": {
    "bool": {
      "must": [
        { "match" : { "_all" : "product" } },
        { "range": { "price": { "gte": 500, "lte": 1000 } } }
      ],
      "must_not": {
        "range": { "revision": { "lt": 15 } }
      }
    }
  }
}

bool 查询根据其他查询和/或过滤器的布尔组合匹配文档。查询和过滤器必须在 must、should 或 must_not 部分指定，并且可以嵌套。

‹›

JSON

JSON

📋

⚙

POST /search
{
  "table":"test1",
  "query": {
    "bool": {
      "must": [
        { "match": {"_all":"keyword"} },
        { "range": { "revision": { "gte": 14 } } }
      ]
    }
  }
}

must 部分中指定的查询和过滤器必须匹配文档。如果指定了多个全文查询或过滤器，则所有这些查询都必须匹配。这相当于 SQL 中的 AND 查询。请注意，如果您要匹配数组（多值属性），可以多次指定该属性。结果仅在数组中找到所有查询值时为正，例如：

"must": [
  {"equals" : { "product_codes": 5 }},
  {"equals" : { "product_codes": 6 }}
]

另外，从性能角度来看，使用：

  {"in" : { "all(product_codes)": [5,6] }}

（请参阅下方的详细信息）可能更好。

should 部分中指定的查询和过滤器应该匹配文档。如果在 must 或 must_not 中指定了某些查询，则 should 查询将被忽略。另一方面，如果除了 should 之外没有其他查询，则至少有一个这些查询必须匹配一个文档，该文档才能匹配 bool 查询。这相当于 OR 查询。请注意，如果您要匹配数组（多值属性），可以多次指定该属性，例如：

"should": [
  {"equals" : { "product_codes": 7 }},
  {"equals" : { "product_codes": 8 }}
]

另外，从性能角度来看，使用：

  {"in" : { "any(product_codes)": [7,8] }}

（请参阅下方的详细信息）可能更好。

must_not 部分中指定的查询和过滤器必须不匹配文档。如果在 must_not 下指定了多个查询，则文档匹配如果它们中的任何一个都不匹配。

‹›

JSON

JSON

📋

⚙

POST /search
{
  "table":"t",
  "query": {
    "bool": {
      "should": [
        {
          "equals": {
            "b": 1
          }
        },
        {
          "equals": {
            "b": 3
          }
        }
      ],
      "must": [
        {
          "equals": {
            "a": 1
          }
        }
      ],
      "must_not": {
        "equals": {
          "b": 2
        }
      }
    }
  }
}

bool 查询可以嵌套在另一个 bool 中，因此您可以创建更复杂的查询。要创建嵌套布尔查询，请使用另一个 bool 而不是 must、should 或 must_not。以下是如何表示此查询：

a = 2 and (a = 10 or b = 0)

以 JSON 格式呈现。

‹›

JSON

JSON

📋

⚙

a = 2 and (a = 10 or b = 0)

POST /search
{
  "table":"t",
  "query": {
    "bool": {
      "must": [
        {
          "equals": {
            "a": 2
          }
        },
        {
          "bool": {
            "should": [
              {
                "equals": {
                  "a": 10
                }
              },
              {
                "equals": {
                  "b": 0
                }
              }
            ]
          }
        }
      ]
    }
  }
}

更复杂的查询：

(a = 1 and b = 1) or (a = 10 and b = 2) or (b = 0)

‹›

JSON

JSON

📋

⚙

(a = 1 and b = 1) or (a = 10 and b = 2) or (b = 0)

POST /search
{
  "table":"t",
  "query": {
    "bool": {
      "should": [
        {
          "bool": {
            "must": [
              {
                "equals": {
                  "a": 1
                }
              },
              {
                "equals": {
                  "b": 1
                }
              }
            ]
          }
        },
        {
          "bool": {
            "must": [
              {
                "equals": {
                  "a": 10
                }
              },
              {
                "equals": {
                  "b": 2
                }
              }
            ]
          }
        },
        {
          "bool": {
            "must": [
              {
                "equals": {
                  "b": 0
                }
              }
            ]
          }
        }
      ]
    }
  }
}

SQL 格式的查询（query_string）也可以在 bool 查询中使用。

‹›

JSON

JSON

📋

⚙

POST /search
{
  "table": "test1",
  "query": {
    "bool": {
      "must": [
        { "query_string" : "product" },
        { "query_string" : "good" }
      ]
    }
  }
}

等值过滤器是最简单的过滤器，适用于整数、浮点数和字符串属性。

‹›

JSON

JSON

📋

⚙

POST /search
{
  "table":"test1",
  "query": {
    "equals": { "price": 500 }
  }
}

equals 过滤器可以应用于多值属性，您可以使用：

any()，如果属性至少有一个值等于查询值，则为正；
all()，如果属性只有一个值且等于查询值，则为正

‹›

JSON

JSON

📋

⚙

POST /search
{
  "table":"test1",
  "query": {
    "equals": { "any(price)": 100 }
  }
}

集合过滤器检查属性值是否等于指定集合中的任何值。

集合过滤器支持整数、字符串和多值属性。

‹›

JSON

JSON

📋

⚙

POST /search
{
  "table":"test1",
  "query": {
    "in": {
      "price": [1,10,100]
    }
  }
}

当应用于多值属性时，您可以使用：

any()（等同于无函数），如果属性值与查询值之间至少有一个匹配，则为正；
all()，如果所有属性值都在查询集中，则为正

‹›

JSON

JSON

📋

⚙

POST /search
{
  "table":"test1",
  "query": {
    "in": {
      "all(price)": [1,10]
    }
  }
}

范围过滤器匹配具有属性值在指定范围内的文档。

范围过滤器支持以下属性：

gte：大于或等于
gt：大于
lte：小于或等于
lt：小于

‹›

JSON

JSON

📋

⚙

POST /search
{
  "table":"test1",
  "query": {
    "range": {
      "price": {
        "gte": 500,
        "lte": 1000
      }
    }
  }
}

geo_distance 过滤器用于过滤距离特定地理位置特定距离内的文档。

指定针的位置，以度为单位。距离从这一点计算。

指定包含纬度和经度的属性。

指定距离计算函数。可以是 adaptive 或 haversine。adaptive 更快且更精确，有关详细信息，请参阅 GEODIST()。可选，默认为 adaptive。

指定从针位置的最大距离。在此距离内的所有文档都匹配。距离可以使用多种单位指定。如果没有指定单位，则距离默认为米。以下是支持的距离单位列表：

米: m 或 meters
公里: km 或 kilometers
厘米: cm 或 centimeters
毫米: mm 或 millimeters
英里: mi 或 miles
码: yd 或 yards
英尺: ft 或 feet
英寸: in 或 inch
海里: NM, nmi 或 nauticalmiles

location_anchor 和 location_source 属性接受以下纬度/经度格式：

具有 lat 和 lon 键的对象: { "lat": "attr_lat", "lon": "attr_lon" }
具有以下结构的字符串: "attr_lat, attr_lon"
按以下顺序排列的数组: [attr_lon, attr_lat]

纬度和经度以度为单位指定。

‹›

Basic example
Advanced example

📋

POST /search
{
  "table":"test",
  "query": {
    "geo_distance": {
      "location_anchor": {"lat":49, "lon":15},
      "location_source": {"attr_lat, attr_lon"},
      "distance_type": "adaptive",
      "distance":"100 km"
    }
  }
}

搜索结果连接

Last modified: August 28, 2025

表连接在Manticore Search中允许您通过匹配相关列将两个表中的文档合并。此功能允许执行更复杂的查询并增强跨多个表的数据检索。

SELECT
    select_expr [, select_expr] ...
    FROM tbl_name
    {INNER | LEFT} JOIN tbl2_name
    ON join_condition
    [...other select options]
join_condition: {
    left_table.attr = right_table.attr
    | left_table.json_attr.string_id = string(right_table.json_attr.string_id)
    | left_table.json_attr.int_id = int(right_table.json_attr.int_id)
    | [..filters on right table attributes]
}

有关选择选项的更多信息，请参阅SELECT部分。

当通过JSON属性的值进行连接时，您需要显式指定该值的类型，使用int()或string()函数。

SELECT ... ON left_table.json_attr.string_id = string(right_table.json_attr.string_id)

SELECT ... ON left_table.json_attr.int_id = int(right_table.json_attr.int_id)

POST /search
{
  "table": "table_name",
  "query": {
    <optional full-text query against the left table>
  },
  "join": [
    {
      "type": "inner" | "left",
      "table": "joined_table_name",
      "query": {
        <optional full-text query against the right table>
      },
      "on": [
        {
          "left": {
            "table": "left_table_name",
            "field": "field_name",
            "type": "<common field's type when joining using json attributes>"
          },
          "operator": "eq",
          "right": {
            "table": "right_table_name",
            "field": "field_name"
          }
        }
      ]
    }
  ],
  "options": {
    ...
  }
}
on.type: {
    int
    | string
}

注意，在left操作数部分有type字段，您应该在使用json属性连接两个表时使用它。允许的值为string和int。

Manticore Search 支持两种类型的连接：

INNER JOIN：仅返回在两个表中都有匹配的行。例如，查询在orders和customers表之间执行INNER JOIN，仅包括匹配的订单。

‹›

SQL
JSON

📋

SELECT product, customers.email, customers.name, customers.address
FROM orders
INNER JOIN customers
ON customers.id = orders.customer_id
WHERE MATCH('maple', customers)
ORDER BY customers.email ASC;

‹›

Response

+---------+-------------------+----------------+-------------------+
| product | customers.email   | customers.name | customers.address |
+---------+-------------------+----------------+-------------------+
| Laptop  | alice@example.com | Alice Johnson  | 123 Maple St      |
| Tablet  | alice@example.com | Alice Johnson  | 123 Maple St      |
+---------+-------------------+----------------+-------------------+
2 rows in set (0.00 sec)

‹›

LEFT JOIN：返回左表的所有行以及右表的匹配行。如果没有匹配，则右表的列返回NULL值。例如，此查询使用LEFT JOIN检索所有客户及其相应的订单。如果没有相应的订单，则会出现NULL值。结果按客户的电子邮件排序，并仅选择客户的姓名和订单数量。

‹›

SQL
JSON

📋

SELECT
name, orders.quantity
FROM customers
LEFT JOIN orders
ON orders.customer_id = customers.id
ORDER BY email ASC;

‹›

Response

+---------------+-----------------+-------------------+
| name          | orders.quantity | @int_attr_email   |
+---------------+-----------------+-------------------+
| Alice Johnson |               1 | alice@example.com |
| Alice Johnson |               1 | alice@example.com |
| Bob Smith     |               2 | bob@example.com   |
| Carol White   |               1 | carol@example.com |
| John Smith    |            NULL | john@example.com  |
+---------------+-----------------+-------------------+
5 rows in set (0.00 sec)

{
  "took": 0,
  "timed_out": false,
  "hits": {
    "total": 5,
    "total_relation": "eq",
    "hits": [
      {
        "_id": 1,
        "_score": 1,
        "_source": {
          "name": "Alice Johnson",
          "address": "123 Maple St",
          "email": "alice@example.com",
          "orders.id": 3,
          "orders.customer_id": 1,
          "orders.quantity": 1,
          "orders.order_date": "2023-01-03",
          "orders.tags": [
            101,
            104
          ],
          "orders.details": {
            "price": 450,
            "warranty": "1 year"
          },
          "orders.product": "Tablet"
        }
      },
      {
        "_id": 1,
        "_score": 1,
        "_source": {
          "name": "Alice Johnson",
          "address": "123 Maple St",
          "email": "alice@example.com",
          "orders.id": 1,
          "orders.customer_id": 1,
          "orders.quantity": 1,
          "orders.order_date": "2023-01-01",
          "orders.tags": [
            101,
            102
          ],
          "orders.details": {
            "price": 1200,
            "warranty": "2 years"
          },
          "orders.product": "Laptop"
        }
      },
      {
        "_id": 2,
        "_score": 1,
        "_source": {
          "name": "Bob Smith",
          "address": "456 Oak St",
          "email": "bob@example.com",
          "orders.id": 2,
          "orders.customer_id": 2,
          "orders.quantity": 2,
          "orders.order_date": "2023-01-02",
          "orders.tags": [
            103
          ],
          "orders.details": {
            "price": 800,
            "warranty": "1 year"
          },
          "orders.product": "Phone"
        }
      },
      {
        "_id": 3,
        "_score": 1,
        "_source": {
          "name": "Carol White",
          "address": "789 Pine St",
          "email": "carol@example.com",
          "orders.id": 4,
          "orders.customer_id": 3,
          "orders.quantity": 1,
          "orders.order_date": "2023-01-04",
          "orders.tags": [
            105
          ],
          "orders.details": {
            "price": 300,
            "warranty": "1 year"
          },
          "orders.product": "Monitor"
        }
      },
      {
        "_id": 4,
        "_score": 1,
        "_source": {
          "name": "John Smith",
          "address": "15 Barclays St",
          "email": "john@example.com",
          "orders.id": 0,
          "orders.customer_id": 0,
          "orders.quantity": 0,
          "orders.order_date": "",
          "orders.tags": [],
          "orders.details": null,
          "orders.product": ""
        }
      }
    ]
  }
}

Manticore Search 中表连接的一个强大功能是能够在连接的两个表中同时执行全文搜索。这允许您创建基于多个表中的文本内容的复杂查询。

您可以为 JOIN 查询中的每个表单独使用 MATCH() 函数。查询根据两个表中的文本内容过滤结果。

‹›

SQL
JSON

📋

SELECT t1.f, t2.f 
FROM t1 
LEFT JOIN t2 ON t1.id = t2.id 
WHERE MATCH('hello', t1) AND MATCH('goodbye', t2);

‹›

Response

+-------------+---------------+
| f           | t2.f          |
+-------------+---------------+
| hello world | goodbye world |
+-------------+---------------+
1 row in set (0.00 sec)

‹›

在JSON API查询中，表特定的全文匹配与SQL不同：

主表查询：根级别上的 "query" 字段应用于主表（在 "table" 中指定）。

连接表查询：每个连接定义可以包括其自己的 "query" 字段，专门应用于该连接表。

‹›

JSON

JSON

📋

⚙

POST /search
{
  "table": "t1",
  "query": {
    "query_string": "hello"
  },
  "join": [
    {
      "type": "left",
      "table": "t2",
      "query": {
        "match": {
          "*": "goodbye"
        }
      },
      "on": [
        {
          "left": {
            "table": "t1",
            "field": "id"
          },
          "operator": "eq",
          "right": {
            "table": "t2",
            "field": "id"
          }
        }
      ]
    }
  ]
}

‹›

Response

{
  "took": 1,
  "timed_out": false,
  "hits": {
    "total": 1,
    "total_relation": "eq",
    "hits": [
      {
        "_id": 1,
        "_score": 1680,
        "t2._score": 1680,
        "_source": {
          "f": "hello world",
          "t2.id": 1,
          "t2.f": "goodbye world"
        }
      }
    ]
  }
}

1. 仅在主表上查询：返回主表中所有匹配的行。对于未匹配的连接记录（LEFT JOIN），SQL返回NULL值，而JSON API返回默认值（数字为0，文本为空字符串）。

‹›

SQL
JSON

📋

SELECT * FROM t1 
LEFT JOIN t2 ON t1.id = t2.id 
WHERE MATCH('database', t1);

‹›

Response

+------+-----------------+-------+------+
| id   | f               | t2.id | t2.f |
+------+-----------------+-------+------+
|    3 | database search |  NULL | NULL |
+------+-----------------+-------+------+
1 row in set (0.00 sec)

2. 连接表上的查询作为过滤器：当连接表有查询时，仅返回同时满足连接条件和查询条件的记录。

‹›

JSON

JSON

📋

⚙

POST /search
{
  "table": "t1",
  "query": {
    "query_string": "database"
  },
  "join": [
    {
      "type": "left",
      "table": "t2",
      "query": {
        "query_string": "nonexistent"
      },
      "on": [
        {
          "left": {
            "table": "t1",
            "field": "id"
          },
          "operator": "eq",
          "right": {
            "table": "t2",
            "field": "id"
          }
        }
      ]
    }
  ]
}

‹›

Response

{
  "took": 0,
  "timed_out": false,
  "hits": {
    "total": 0,
    "total_relation": "eq",
    "hits": []
  }
}

3. 连接类型影响过滤：INNER JOIN 要求同时满足连接和查询条件，而LEFT JOIN即使右表条件失败也会返回匹配的左表行。

在使用连接中的全文匹配时，请注意以下几点：

表特定匹配：
- SQL：每个 MATCH() 函数应指定要搜索的表：MATCH('term', table_name)
- JSON：使用根级的 "query" 用于主表，并在每个连接定义中的 "query" 用于连接表
查询语法灵活性：JSON API 支持 "query_string" 和 "match" 两种全文查询语法
性能影响：在两个表上执行全文匹配可能会影响查询性能，特别是在大数据集上。考虑使用适当的索引和批次大小。
NULL/默认值处理：使用LEFT JOIN时，如果右表中没有匹配记录，查询优化器将根据性能决定是否先评估全文条件还是过滤条件。SQL返回NULL值，而JSON API返回默认值（数字为0，文本为空字符串）。
过滤行为：连接表上的查询作为过滤器 - 它们限制结果为同时满足连接和查询条件的记录。
全文运算符支持：所有全文运算符都支持在连接查询中使用，包括短语、接近、字段搜索、NEAR、共识匹配和高级运算符。
评分计算：每个表都维护自己的相关性评分，可通过SQL中的 table_name.weight() 或JSON响应中的 table_name._score 访问。

在前面的示例基础上，让我们探索一个更高级的场景，其中我们将表连接与跨多个表的全文匹配和分面结合。这展示了Manticore连接功能的全部力量，包括复杂的过滤和聚合。

该查询演示了在 customers 和 orders 表上进行全文匹配，并结合范围过滤和分面功能。它搜索名为 "Alice" 或 "Bob" 的客户及其包含 "laptop"、"phone" 或 "tablet" 且价格高于 $500 的订单。结果按订单 ID 排序，并按保修条款进行分面。

‹›

SQL
JSON

📋

SELECT orders.product, name, orders.details.price, orders.tags
FROM customers
LEFT JOIN orders ON customers.id = orders.customer_id
WHERE orders.details.price > 500
AND MATCH('laptop | phone | tablet', orders)
AND MATCH('alice | bob', customers)
ORDER BY orders.id ASC
FACET orders.details.warranty;

‹›

Response

+-----------------+---------------+----------------------+-------------+
| orders.product  | name          | orders.details.price | orders.tags |
+-----------------+---------------+----------------------+-------------+
| Laptop Computer | Alice Johnson |                 1200 | 101,102     |
| Smart Phone     | Bob Smith     |                  800 | 103         |
+-----------------+---------------+----------------------+-------------+
2 rows in set (0.00 sec)
+-------------------------+----------+
| orders.details.warranty | count(*) |
+-------------------------+----------+
| 2 years                 |        1 |
| 1 year                  |        1 |
+-------------------------+----------+
2 rows in set (0.00 sec)

‹›

可以为连接查询中的左表和右表分别指定不同的选项。SQL 查询的语法是 OPTION(<table_name>)，JSON 查询则是在 "options" 下使用一个或多个子对象。

以下示例展示了如何为右表的全文查询指定不同的字段权重。要通过 SQL 获取匹配权重，请使用 <table_name>.weight() 表达式。在 JSON 查询中，此权重表示为 <table_name>._score。

‹›

SQL
JSON

📋

SELECT product, customers.email, customers.name, customers.address, customers.weight()
FROM orders
INNER JOIN customers
ON customers.id = orders.customer_id
WHERE MATCH('maple', customers)
OPTION(customers) field_weights=(address=1500);

‹›

Response

+---------+-------------------+----------------+-------------------+--------------------+
| product | customers.email   | customers.name | customers.address | customers.weight() |
+---------+-------------------+----------------+-------------------+--------------------+
| Laptop  | alice@example.com | Alice Johnson  | 123 Maple St      |            1500680 |
| Tablet  | alice@example.com | Alice Johnson  | 123 Maple St      |            1500680 |
+---------+-------------------+----------------+-------------------+--------------------+
2 rows in set (0.00 sec)

在执行表连接时，Manticore Search 会以批处理方式处理结果，以优化性能和资源使用。其工作原理如下：

批处理如何工作：
- 首先执行左表的查询，并将结果累积到一个批次中。
- 然后，该批次作为右表查询的输入，右表查询作为单个操作执行。
- 这种方法最大限度地减少了发送到右表的查询次数，提高了效率。
配置批次大小：
- 可以使用 join_batch_size 搜索选项调整批次的大小。
- 也可以在配置文件的 searchd 部分中配置此选项。
- 默认批次大小为 1000，但您可以根据您的用例增加或减少它。
- 设置 join_batch_size=0 将完全禁用批处理，这可能对调试或特定场景有用。
性能考虑：
- 较大的批次大小可以通过减少在右表上执行的查询次数来提高性能。
- 但是，较大的批次可能会消耗更多内存，特别是对于复杂查询或大型数据集。
- 尝试不同的批次大小，以找到性能和资源使用之间的最佳平衡点。

为了进一步优化连接操作，Manticore Search 对右表执行的查询采用了缓存机制。以下是您需要了解的内容：

缓存如何工作：
- 右表的每个查询都由 JOIN ON 条件定义。
- 如果相同的 JOIN ON 条件在多个查询中重复出现，结果将被缓存并重复使用。
- 这避免了冗余查询，并加快了后续连接操作的速度。
配置缓存大小：
- 连接缓存的大小可以通过配置文件 searchd 部分中的 join_cache_size 选项进行配置。
- 默认缓存大小为 20MB，但您可以根据工作负载和可用内存进行调整。
- 设置 join_cache_size=0 将完全禁用缓存。
内存考虑：
- 每个线程维护自己的缓存，因此总内存使用量取决于线程数量和缓存大小。
- 请确保您的服务器有足够的内存来容纳缓存，特别是在高并发环境中。

仅由本地表组成的分布式表在连接查询的左侧和右侧都受支持。但是，包含远程表的分布式表不受支持。

在 Manticore Search 中使用 JOIN 时，请记住以下几点：

字段选择：在 JOIN 中从两个表选择字段时，右表的字段需以表名作为前缀。左表的字段可以使用或不使用表名前缀。例如：
```
SELECT field_name, right_table.field_name FROM ...
-- or with left table prefix:
SELECT left_table.field_name, right_table.field_name FROM ...
```

JOIN 条件：始终在 JOIN 条件中明确指定表名：

JOIN ON table_name.some_field = another_table_name.some_field

使用 JOIN 的表达式：当使用组合了连接表中两个字段的表达式时，请为表达式的结果设置别名：
```
SELECT *, (nums2.n + 3) AS x, x * n FROM nums LEFT JOIN nums2 ON nums2.id = nums.num2_id
```
对带别名的表达式进行过滤：不能在 WHERE 子句中使用涉及两个表字段的表达式的别名。

JSON 属性：在 JSON 属性上进行连接时，必须将值显式转换为适当的类型：

-- Correct:
SELECT * FROM t1 LEFT JOIN t2 ON int(t1.json_attr.id) = t2.json_attr.id
-- Incorrect:
SELECT * FROM t1 LEFT JOIN t2 ON t1.json_attr.id = t2.json_attr.id

NULL 处理：可以在连接字段上使用 IS NULL 和 IS NOT NULL 条件：

SELECT * FROM t1 LEFT JOIN t2 ON t1.id = t2.id WHERE t2.name IS NULL
SELECT * FROM t1 LEFT JOIN t2 ON t1.id = t2.id WHERE t2.name IS NOT NULL

在 MVA 中使用 ANY：在 JOIN 中对多值属性使用 ANY() 函数时，请为连接表中的多值属性设置别名：
```
SELECT *, t2.m AS alias
FROM t
LEFT JOIN t2 ON t.id = t2.t_id
WHERE ANY(alias) IN (3, 5)
```

遵循这些指南，您可以有效地在 Manticore Search 中使用 JOIN 来组合来自多个索引的数据并执行复杂查询。

过滤器表达式

Last modified: April 13, 2026

Manticore 通过 SQL 和 HTTP 支持使用任意的算术表达式，结合属性值、内部属性（文档 ID 和相关度权重）、算术运算、多种内建函数以及用户自定义函数。以下是完整的参考列表，方便快速查阅。

+, -, *, /, %, DIV, MOD

提供标准算术运算符。使用这些运算符的算术计算可以以三种不同模式执行：

使用单精度、32 位 IEEE 754 浮点值（默认），
使用有符号 32 位整数，
使用有符号 64 位整数。

表达式解析器会自动切换到整数模式，如果没有任何运算结果为浮点值。否则，使用默认的浮点模式。例如，当两个参数都是 32 位整数时，a+b 会以 32 位整数计算；当两个参数都是整数但其中一个为 64 位时，使用 64 位整数；否则以浮点计算。然而，a/b 或 sqrt(a) 始终以浮点计算，因为这些运算返回非整数结果。为避免这种情况，可以使用 IDIV(a,b) 或 DIV b 形式。另外，当参数是 32 位时，a*b 不会自动提升到 64 位。若要强制使用 64 位结果，请使用 BIGINT()，但注意如果存在非整数运算，BIGINT() 将被忽略。

<, > <=, >=, =, <>

比较运算符返回 1.0 时条件为真，返回 0.0 时条件为假。例如，当属性 a 等于属性 b 时，表达式 (a=b)+3 计算值为 4；当 a 不等于 b 时，计算值为 3。与 MySQL 不同，等值比较（即 = 和 <> 运算符）包含一个小的相等阈值（默认是 1e-6）。如果比较值的差距在阈值内，则认为它们相等。

对于多值属性，BETWEEN 和 IN 运算符只要至少一个值满足条件就返回真（类似于 ANY()）。IN 运算符不支持 JSON 属性。IS (NOT) NULL 运算符仅支持 JSON 属性。

AND, OR, NOT

布尔运算符（AND、OR、NOT）的行为符合预期。它们是左结合的，并且优先级最低。NOT 的优先级高于 AND 和 OR，但仍低于其他运算符。AND 和 OR 具有相同优先级，建议在复杂表达式中使用括号以避免混淆。

&, |

这些运算符分别执行按位与和按位或。操作数必须是整数类型。

在 HTTP JSON 接口中，通过 script_fields 和 expressions 支持表达式。

{
    "table": "test",
    "query": {
        "match_all": {}
    }, "script_fields": {
        "add_all": {
            "script": {
                "inline": "( gid * 10 ) | crc32(title)"
            }
        },
        "title_len": {
            "script": {
                "inline": "crc32(title)"
            }
        }
    }
}

在这个示例中，创建了两个表达式：add_all 和 title_len。第一个表达式计算 ( gid * 10 ) | crc32(title) 并将结果存储在 add_all 属性中。第二个表达式计算 crc32(title) 并将结果存储在 title_len 属性中。

目前，仅支持 inline 表达式。inline 属性（要计算的表达式）的值具有与 SQL 表达式相同的语法。

表达式名称可以在过滤或排序中使用。

‹›

script_fields

script_fields

📋

{
    "table":"movies_rt",
    "script_fields":{
        "cond1":{
            "script":{
                "inline":"actor_2_facebook_likes =296 OR movie_facebook_likes =37000"
            }
        },
        "cond2":{
            "script":{
                "inline":"IF (IN (content_rating,'TV-PG','PG'),2, IF(IN(content_rating,'TV-14','PG-13'),1,0))"
            }
        }
    },
    "limit":10,
    "sort":[
        {
            "cond2":"desc"
        },
        {
            "actor_1_name":"asc"
        },
        {
            "actor_2_name":"desc"
        }
    ],
    "profile":true,
    "query":{
        "bool":{
            "must":[
                {
                    "match":{
                        "*":"star"
                    }
                },
                {
                    "equals":{
                        "cond1":1
                    }
                }
            ],
            "must_not":[
                {
                    "equals":{
                        "content_rating":"R"
                    }
                }
            ]
        }
    }
}

默认情况下，表达式值包含在结果集的 _source 数组中。如果源是选择性的（参见源选择），可以在请求中将表达式名称添加到 _source 参数中。注意，表达式的名称必须是小写的。

expressions 是 script_fields 的替代方案，语法更简单。示例请求添加了两个表达式并将结果存储在 add_all 和 title_len 属性中。注意，表达式的名称必须是小写的。

‹›

expressions

expressions

📋

{
  "table": "test",
  "query": { "match_all": {} },
  "expressions":
  {
      "add_all": "( gid * 10 ) | crc32(title)",
      "title_len": "crc32(title)"
  }
}

连接搜索选项

Last modified: August 28, 2025

搜索结果

SQL

HTTP

源选择

过滤器

WHERE

HTTP JSON

bool 查询

must

should

must_not

嵌套 bool 查询

SQL 格式的查询

各种过滤器

等值过滤器

集合过滤器

范围过滤器

地理距离过滤器

location_anchor

location_source

distance_type

距离

Joining tables

通用语法

SQL

JSON

连接类型

跨连接表的全文匹配

连接的JSON查询结构

理解连接操作中的查询行为

使用连接中的全文匹配的重要注意事项

示例：复杂的连接与分面

搜索选项与匹配权重

连接批处理

连接缓存

连接分布式表

注意事项与最佳实践

搜索中的表达式

算术运算符

比较运算符

布尔运算符

位运算符

函数：

HTTP JSON 中的表达式

script_fields

expressions