最近搜索


没有最近搜索

Federated Search crawlers need to target an HTML element containing the content



已于 2022年8月04日 发布

Feature Request Summary: 

The configuration options for Federated Search crawlers should include a way to target the HTML element that contains the content to be indexed.

Description/Use Cases: 

I want the crawler to crawl just the main content of my external content.

Business impact of limitation or missing feature:

Currently, the crawler appears to crawl the first ten thousand characters of text found on the external source. In our case, our page header and navigation contain more than ten thousand characters so the result of the crawl is 170 pages of exactly the same content.

This makes the crawl feature unusable and we'll need to build an integration using the API to use this feature.


2

0

0 条评论

请先登录再写评论。

找不到所需的内容?

新建帖子