集成 > Kafka | Manticore Search Manual

注意：与 Fluent Bit 的集成需要 Manticore Buddy。如果无法正常工作，请确保已安装 Buddy。

Vector by Datadog 是一个开源的可观测性数据管道，可以收集、转换和路由日志或指标。虽然 Vector 可以自行聚合数据，但将其与 Manticore 配合使用可提供专用的存储和搜索层。

以下示例展示了如何通过 Vector.dev 转发 Debian 的 dpkg.log，并将其索引到 Manticore 中。

2023-05-31 10:42:55 status triggers-awaited ca-certificates-java:all 20190405ubuntu1.1
2023-05-31 10:42:55 trigproc libc-bin:amd64 2.31-0ubuntu9.9 <none>
2023-05-31 10:42:55 status half-configured libc-bin:amd64 2.31-0ubuntu9.9
2023-05-31 10:42:55 status installed libc-bin:amd64 2.31-0ubuntu9.9
2023-05-31 10:42:55 trigproc systemd:amd64 245.4-4ubuntu3.21 <none>

创建一个类似于以下内容的 vector.toml：

[sources.test_file]
type = "file"
include = [ "/var/log/dpkg.log" ]
[transforms.modify_test_file]
type = "remap"
inputs = [ "test_file" ]
source = """
.vec_timestamp = del(.timestamp)"""
[sinks.manticore]
type = "elasticsearch"
inputs = [ "modify_test_file" ]
endpoints = ["http://127.0.0.1:9308"]
bulk.index = "dpkg_log"

endpoints 指向 Manticore 的 HTTP 接口（默认端口为 9308）。如果您的实例在其他位置监听，请进行调整。
remap 转换将 Vector 的默认 timestamp 字段移动到 vec_timestamp，因为 timestamp 在 Manticore 中是保留字段。
bulk.index 定义了当 Vector 开始发送数据时将自动创建的表。

使用此配置启动 Vector.dev，它将跟踪日志文件，转换每个事件，并直接将其转发到 Manticore。

将配置保存为 vector.toml，然后启动代理：

vector --config vector.toml

如果在 Docker 中运行 Vector.dev，请挂载配置文件和日志目录，例如：

docker run --rm -v /var/log/dpkg.log:/var/log/dpkg.log:ro \
  -v $(pwd)/vector.toml:/etc/vector/vector.toml:ro \
  timberio/vector:latest --config /etc/vector/vector.toml

当管道运行时，Manticore 会自动创建 dpkg_log 表。其模式和示例文档如下所示：

mysql> DESCRIBE dpkg_log;
+-----------------+---------+--------------------+
| Field           | Type    | Properties         |
+-----------------+---------+--------------------+
| id              | bigint  |                    |
| file            | text    | indexed stored     |
| host            | text    | indexed stored     |
| message         | text    | indexed stored     |
| source_type     | text    | indexed stored     |
| vec_timestamp   | text    | indexed stored     |
+-----------------+---------+--------------------+
mysql> SELECT * FROM dpkg_log LIMIT 3\G
*************************** 1. row ***************************
id: 7856533729353672195
file: /var/log/dpkg.log
host: logstash-787f68f6f-nhdd2
message: 2023-06-05 14:03:04 startup archives install
source_type: file
vec_timestamp: 2023-08-04T15:32:50.203091741Z
*************************** 2. row ***************************
id: 7856533729353672196
file: /var/log/dpkg.log
host: logstash-787f68f6f-nhdd2
message: 2023-06-05 14:03:04 install base-passwd:amd64 <none> 3.5.47
source_type: file
vec_timestamp: 2023-08-04T15:32:50.203808861Z
*************************** 3. row ***************************
id: 7856533729353672197
file: /var/log/dpkg.log
host: logstash-787f68f6f-nhdd2
message: 2023-06-05 14:03:04 status half-installed base-passwd:amd64 3.5.47
source_type: file
vec_timestamp: 2023-08-04T15:32:50.203814031Z

将 Vector.dev 与 Manticore 结合使用，您可以从几乎所有来源收集日志，在传输过程中丰富或清理日志，并将结果存储在可搜索的数据库中。此工作流程在保持可观测性管道简单的同时，仍可在需要时启用高级转换。

Kibana

Last modified: January 08, 2026

Kibana 是一个可视化界面，允许您探索、可视化和创建日志数据的仪表盘。将 Kibana 与 Manticore Search 集成可以比 Elasticsearch 加快 Kibana 可视化加载速度高达 3 倍，如该演示中所示。此集成使用户能够通过交互式仪表盘、自定义可视化和实时搜索功能无缝分析其数据。它还通过支持 Logstash 和 Filebeat 等工具简化了处理多样数据源的过程，从而实现流畅的数据摄取，使其成为日志分析工作流的绝佳选择。

下载 Kibana：确保下载与 Manticore 兼容的 Kibana 版本。目前，推荐并测试的版本为 7.6.0。其他 7.x 版本可能可用，但可能带来问题。不支持 8.x 版本。
验证 Manticore：确保您的 Manticore 实例正在运行且其 HTTP API 可访问（默认地址：http://localhost:9308）。

打开 Kibana 配置文件（kibana.yml）。

设置您的 Manticore 实例的 URL：

elasticsearch.hosts: ["http://localhost:9308"]

启动 Kibana 并在浏览器中打开 http://localhost:5601。如有必要，将 localhost 替换为服务器的 IP 或主机名。

注意：Manticore 在与 Kibana 配合使用时不需要进行身份验证设置。另请注意，Manticore 必须在实时模式下工作才能与 Kibana 集成。


    listen = 127.0.0.1:9308:http
    pid_file = /var/run/manticore/searchd.pid
    data_dir = /var/lib/manticore
 }

## Supported Features
### Discover
- Use the **Discover** tab in Kibana to search and filter your data interactively.
- Refine your searches using the query bar with simple queries in the %Kibana query language [https://www.elastic.co/guide/en/kibana/current/kuery-query.html] %.

### Visualizations
- Navigate to **Visualizations** to create custom visualizations:
  - Create a table pattern (it’s called an 'index pattern' in Kibana) if one doesn’t already exist to define your data source.
  - Choose a visualization type (e.g., bar chart, line chart, or pie chart).
  - Configure your visualization, execute it, and explore your data.
  - Save your visualizations for future use.

### Dashboards
- Access **Dashboards** to create or view interactive dashboards:
  - Add visualizations, filters, or controls for a personalized experience.
  - Interact with your data directly from the dashboard.
  - Save dashboards for future use.

### Management
- Go to **Management > Kibana** to customize settings like default time zones and visualization preferences.

## Limitations
- Currently, Kibana version 7.6.0 is tested and recommended. Other 7.x versions may work but could cause issues. Versions 8.x are not supported.
- The following Elasticsearch-specific field types are not supported:
  - %Spatial data types [https://www.elastic.co/guide/en/elasticsearch/reference/current/mapping-types.html#spatial_datatypes] %
  - %Structured data types [https://www.elastic.co/guide/en/elasticsearch/reference/current/mapping-types.html#structured-data-types] %
  - %Document ranking types [https://www.elastic.co/guide/en/elasticsearch/reference/current/mapping-types.html#document-ranking-types] %
  - %Text search types [https://www.elastic.co/guide/en/elasticsearch/reference/current/mapping-types.html#text-search-types] % (except for plain 'text')
  - %Relational data types [https://www.elastic.co/guide/en/elasticsearch/reference/current/mapping-types.html#object-types] %
- Metric aggregation functions are limited to %those supported by Manticore [../Searching/Grouping.md#Aggregation-functions] %.
- The following Kibana tools are not supported:
  - %Canvas [https://www.elastic.co/guide/en/kibana/7.6/canvas.html] % – A visualization and presentation tool for combining data with colors and images.
  - %Elastic Maps [https://www.elastic.co/guide/en/kibana/7.6/maps.html] % – A tool for analyzing geographical data.
  - %Metrics [https://www.elastic.co/guide/en/kibana/7.6/xpack-infra.html] % – An app for monitoring infrastructure metrics.
  - %Logs [https://www.elastic.co/guide/en/kibana/7.6/xpack-logs.html] % – A console-like display for exploring logs from common services.
  - Monitoring:
    - %Uptime [https://www.elastic.co/guide/en/kibana/7.6/xpack-uptime.html] % – Monitors the status of network endpoints via HTTP/S, TCP, and ICMP.
    - %APM (Application Performance Monitoring) [https://www.elastic.co/guide/en/kibana/7.6/xpack-apm.html] % – Collects in-depth performance metrics from applications.
    - %SIEM (Security Information and Event Management) [https://www.elastic.co/guide/en/kibana/7.6/xpack-siem.html] % – An interactive workspace for security teams to triage events and conduct initial investigations.
    - %ILM (Index lifecycle management) [https://www.elastic.co/guide/en/elasticsearch/reference/7.6/index-lifecycle-management.html] % - Automatically manage indices according to performance, resiliency, and retention requirements.
    - %Stack Monitoring [https://www.elastic.co/guide/en/kibana/7.6/xpack-monitoring.html] % – Provides visualizations of monitoring data across the Elastic Stack.
  - %Elasticsearch Management [https://www.elastic.co/guide/en/kibana/7.6/management.html] % – A UI for managing Elastic Stack objects, including ILM (Index Lifecycle Management), etc.

## Data Ingestion and Exploration
Integrate Manticore with tools like %Logstash [../Integration/Logstash.md] %, %Filebeat [../Integration/Filebeat.md] %, %Fluentbit [https://manticoresearch.com/blog/integration-of-manticore-with-fluentbit/] %, or %Vector.dev [https://manticoresearch.com/blog/integration-of-manticore-with-vectordev/] % to ingest data from sources like web logs. Once the data is loaded into Manticore, you can explore and visualize it in Kibana.

Vector.dev Kafka

Last modified: November 07, 2025

NOTE: 此功能需要 Manticore Buddy。如果不起作用，请确保已安装 Buddy。

Manticore 支持通过 Kafka 源和物化视图与 Apache Kafka 实时数据摄取集成，允许实时数据索引和搜索。目前，apache/kafka 版本 3.7.0-4.1.0 已经过测试并支持。

要开始，请执行以下步骤：

定义源： 指定 Manticore Search 将从中读取消息的 Kafka 主题。此设置包括有关代理的主机、端口和主题名称的详细信息。
设置目标表： 选择一个 Manticore 实时表来存储传入的 Kafka 数据。
创建物化视图： 设置物化视图 (mv) 以处理从 Kafka 到 Manticore Search 目标表的数据转换和映射。在这里，您将定义字段映射、数据转换以及传入数据流中的任何过滤器或条件。

source 配置允许您定义 broker、主题列表、消费者组 以及消息结构。

使用 Manticore 字段类型（如 int、float、text、json 等）定义模式。

CREATE SOURCE <source name> [(column type, ...)] [source_options]

所有模式键都是大小写不敏感的，这意味着 Products、products 和 PrOdUcTs 被视为相同的。它们都会转换为小写。

如果您的字段名不符合 Manticore Search 允许的字段名语法（例如，包含特殊字符或以数字开头），则必须定义模式映射。例如，$keyName 或 123field 是 JSON 中的有效键，但在 Manticore Search 中不是有效的字段名。如果您尝试使用无效的字段名而没有适当的映射，Manticore 将返回错误并导致源创建失败。

要处理此类情况，请使用以下模式语法将无效字段名映射到有效字段名：

allowed_field_name 'original JSON key name with special symbols' type

例如：

price_field '$price' float    -- maps JSON key '$price' to field 'price_field'
field_123 '123field' text     -- maps JSON key '123field' to field 'field_123'

‹›

SQL

📋

CREATE SOURCE kafka
(id bigint, term text, abbrev '$abbrev' text, GlossDef json)
type='kafka'
broker_list='kafka:9092'
topic_list='my-data'
consumer_group='manticore'
num_consumers='2'
batch=50

‹›

Response

Query OK, 2 rows affected (0.02 sec)

选项	允许值	描述
`type`	`kafka`	设置源类型。目前，仅支持 `kafka`
`broker_list`	`host:port [, ...]`	指定 Kafka 代理 URL
`topic_list`	`string [, ...]`	列出要从中消费的 Kafka 主题
`consumer_group`	`string`	定义 Kafka 消费者组，默认为 `manticore`。
`num_consumers`	`int`	处理消息的消费者数量。
`partition_list`	`int [, ...]`	用于读取的分区列表更多。
`batch`	`int`	在继续之前处理的消息数量。默认为 `100`；超时后处理剩余消息。

目标表是一个常规的实时表，其中存储了 Kafka 消息处理的结果。此表应定义为符合传入数据的模式要求，并针对应用程序的查询性能需求进行优化。有关创建实时表的更多信息，请参阅这里。

‹›

SQL

📋

CREATE TABLE destination_kafka
(id bigint, name text, short_name text, received_at text, size multi);

‹›

Response

Query OK, 0 rows affected (0.02 sec)

物化视图允许从 Kafka 消息中进行数据转换。您可以重命名字段、应用 Manticore Search 函数，并执行排序、分组和其他数据操作。

物化视图充当从 Kafka 源到目标表移动数据的查询，让您使用 Manticore Search 语法自定义这些查询。确保物化视图中的 select 字段与源中的字段匹配。

CREATE MATERIALIZED VIEW <materialized view name>
TO <destination table name> AS
SELECT [column|function [as <new name>], ...] FROM <source name>

‹›

SQL

📋

CREATE MATERIALIZED VIEW view_table
TO destination_kafka AS
SELECT id, term as name, abbrev as short_name,
       UTC_TIMESTAMP() as received_at, GlossDef.size as size FROM kafka

‹›

Response

Query OK, 2 rows affected (0.02 sec)

数据以批次的形式从 Kafka 转移到 Manticore Search，并在每次运行后清除。对于跨批次的计算，例如 AVG，请谨慎使用，因为这些可能不会按预期工作，因为是按批次处理的。

以下是基于上述示例的映射表：

Kafka	源	缓冲	物化视图	目标
`id`	`id`	`id`	`id`	`id`
`term`	`term`	`term`	`term as name`	`name`
`unnecessary_key` which we're not interested in	-	-
`$abbrev`	`abbrev`	`abbrev`	`abbrev` as `short_name`	`short_name`
-	-	-	`UTC_TIMESTAMP() as received_at`	`received_at`
`GlossDef`	`glossdef`	`glossdef`	`glossdef.size as size`	`size`

要在 Manticore Search 中查看源和物化视图，请使用以下命令：

SHOW SOURCES：列出所有配置的源。
SHOW MVS：列出所有物化视图。
SHOW MV view_table：显示特定物化视图的详细信息。

‹›

SQL

📋

SHOW SOURCES

‹›

Response

+-------+
| name  |
+-------+
| kafka |
+-------+

‹›

SQL

📋

SHOW SOURCE kafka;

‹›

Response

+--------+-------------------------------------------------------------------+
| Source | Create Table                                                      |
+--------+-------------------------------------------------------------------+
| kafka  | CREATE SOURCE kafka                                               |
|        | (id bigint, term text, abbrev '$abbrev' text, GlossDef json)      |
|        | type='kafka'                                                      |
|        | broker_list='kafka:9092'                                          |
|        | topic_list='my-data'                                              |
|        | consumer_group='manticore'                                        |
|        | num_consumers='2'                                                 |
|        | batch=50                                                          |
+--------+-------------------------------------------------------------------+

‹›

SQL

📋

SHOW MVS

‹›

Response

+------------+
| name       |
+------------+
| view_table |
+------------+

‹›

SQL

📋

SHOW MV view_table

‹›

Response

+------------+--------------------------------------------------------------------------------------------------------+-----------+
| View       | Create Table                                                                                           | suspended |
+------------+--------------------------------------------------------------------------------------------------------+-----------+
| view_table | CREATE MATERIALIZED VIEW view_table TO destination_kafka AS                                            | 0         |
|            | SELECT id, term as name, abbrev as short_name, UTC_TIMESTAMP() as received_at, GlossDef.size as size   |           |
|            | FROM kafka                                                                                             |           |
+------------+--------------------------------------------------------------------------------------------------------+-----------+

您可以通过修改物化视图来暂停数据消费。

如果您删除了source但没有删除物化视图，它会自动暂停。重新创建源后，需要使用ALTER命令手动取消暂停物化视图。

目前，只能修改物化视图。要更改source参数，请删除并重新创建源。

‹›

SQL

📋

ALTER MATERIALIZED VIEW view_table suspended=1

‹›

Response

Query OK (0.02 sec)

您还可以为每个 Kafka 主题指定partition_list。这种方法的主要好处之一是能够通过 Kafka 为您的表实现分片。为此，您应该为每个分片创建一条独立的source → 物化视图 → 目标表链：

源:

CREATE SOURCE kafka_p1 (id bigint, term text)
  type='kafka' broker_list='kafka:9092' topic_list='my-data'
  consumer_group='manticore' num_consumers='1' partition_list='0' batch=50;
CREATE SOURCE kafka_p2 (id bigint, term text)
  type='kafka' broker_list='kafka:9092' topic_list='my-data'
  consumer_group='manticore' num_consumers='1' partition_list='1' batch=50;

目标表:

CREATE TABLE destination_shard_1 (id bigint, name text);
CREATE TABLE destination_shard_2 (id bigint, name text);

物化视图:

CREATE MATERIALIZED VIEW mv_1 TO destination_shard_1 AS SELECT id, term AS name FROM kafka_p1;
CREATE MATERIALIZED VIEW mv_2 TO destination_shard_2 AS SELECT id, term AS name FROM kafka_p2;

在此设置中，重新均衡必须手动管理。
Kafka 默认情况下不使用轮询策略分发消息。
若要在发送数据时实现类似轮询的分发，请确保您的 Kafka 生产者配置了：
- parse.key=true
- key.separator={your_delimiter}

否则，Kafka 会根据其内部规则分发消息，这可能导致分区不均。

Kafka 在每个批次后或处理超时时提交偏移量。如果物化视图查询过程中意外停止，您可能会看到重复条目。为避免此情况，请在您的模式中包含一个id字段，使 Manticore Search 能够防止表中的重复。

工作线程初始化： 配置源和物化视图后，Manticore Search 会设置专用工作线程处理来自 Kafka 的数据摄取。
消息映射： 消息根据源配置的模式映射，转换为结构化格式。
批处理： 消息被分组为批次以提高处理效率。批大小可调整以满足性能和延迟需求。
缓冲： 映射后的数据批次存储在缓冲表中，以便高效的批量操作。
物化视图处理： 对缓冲表中的数据应用视图逻辑，执行任何转换或过滤。
数据传输： 处理后的数据随后传输到目标的实时表。
清理： 每个批次完成后清空缓冲表，确保为下一批数据做好准备。

Kibana DBeaver

Last modified: October 28, 2025

NOTE: 与DBeaver的集成需要Manticore Buddy。如果不起作用，请确保已安装Buddy。

DBeaver 是一个SQL客户端软件应用程序和数据库管理工具。对于MySQL数据库，它通过JDBC驱动程序使用JDBC应用程序编程接口与它们进行交互。

Manticore允许您使用DBeaver与存储在Manticore表中的数据进行操作，就像这些数据存储在MySQL数据库中一样。目前，已测试并推荐使用版本25.2.0。其他版本可能也能工作，但可能会引入问题。

要开始使用DBeaver中的Manticore，请按照以下步骤操作：

在DBeaver的UI中选择新建数据库连接选项
选择SQL -> MySQL作为DBeaver的数据库驱动
设置服务器主机和端口选项，对应于您的Manticore实例的主机和端口（保持数据库字段为空）
设置root/<空密码>作为身份验证凭据

由于Manticore不完全支持MySQL，因此在使用Manticore时，DBeaver的部分功能是不可用的。

您可以：

查看、创建、删除和重命名表
添加和删除表列
插入、删除和更新列数据

您无法：

使用数据库完整性检查机制（MyISAM将是唯一可用的存储引擎）
使用MySQL存储过程、触发器、事件等
管理数据库用户
设置其他数据库管理选项

一些MySQL数据类型目前不被Manticore支持，因此在使用DBeaver创建新表时无法使用。此外，支持的一些数据类型在转换时会被转换为最接近的Manticore类型，类型精度在转换时会被忽略。以下是MySQL数据类型及其映射到的Manticore类型列表：

BIGINT UNSIGNED => bigint
BOOL => boolean
DATE, DATETIME, TIMESTAMP => timestamp
FLOAT => float
INT => int
INT UNSIGNED, SMALLINT UNSIGNED, TINYINT UNSIGNED, BIT => uint
JSON => json
TEXT, LONGTEXT, MEDIUMTEXT, TINYTEXT, BLOB, LONGBLOB, MEDIUMBLOB, TINYBLOB => text
VARCHAR, LONG VARCHAR, BINARY, CHAR, VARBINARY, LONG VARBINARY => string

您可以在此处找到更多关于Manticore数据类型的详细信息这里。

Manticore能够处理DATE、DATETIME和TIMESTAMP数据类型，但需要Manticore的Buddy启用。否则，尝试操作这些类型之一将导致错误。

请注意，TIME类型不受支持。

DBeaver的首选项 -> 连接 -> 客户端标识选项必须不被关闭或覆盖。为了正确使用DBeaver，Manticore需要能够区分其请求与其他请求。为此，它使用DBeaver在请求头中发送的客户端通知信息。禁用客户端通知将破坏这种检测，从而影响Manticore的正确功能。
当第一次尝试更新表中的数据时，您将看到无唯一键弹出消息，并被要求定义一个自定义唯一键。当您收到此消息时，请执行以下步骤：
- 选择自定义唯一键选项
- 在列列表中仅选择id列
- 点击确定

之后，您将能够安全地更新您的数据。

Kafka Apache Superset

Last modified: October 02, 2025

与 Vector.dev 的集成

示例日志结构

Vector.dev 配置

运行 Vector.dev

Vector.dev 结果

结论

Manticore 与 Kibana 的集成

前提条件

配置

Manticore 配置示例

Kafka 同步

源

模式

选项

目标表

物化视图

字段映射

列表

修改物化视图

使用 Kafka 分片

⚠️ 重要注意事项：

故障排除

重复条目

内部工作原理

Manticore与DBeaver的集成

使用的设置

可用的功能

数据类型处理

关于日期类型

可能的问题