@vectorize-io/vectorize-client
    Preparing search index...

    Interface WEBCRAWLERConfig

    Configuration for Web Crawler connector

    WEBCRAWLERConfig

    interface WEBCRAWLERConfig {
        allowedDomainsOpt?: string[];
        forbiddenPaths?: string[];
        maxDepth?: number;
        maxErrorCount?: number;
        maxUrls?: number;
        minTimeBetweenRequests?: number;
        reindexIntervalSeconds?: number;
    }
    Index

    Properties

    allowedDomainsOpt?: string[]

    Additional Allowed URLs or prefix(es). Add one or more allowed URLs or URL prefixes. The crawler will read URLs that match these patterns in addition to the seed URL(s).. Example: (e.g. https://docs.example.com)

    WEBCRAWLERConfig

    forbiddenPaths?: string[]

    Forbidden Paths. Example: Enter forbidden paths (e.g. /admin)

    WEBCRAWLERConfig

    maxDepth?: number

    Max Depth. Example: Enter maximum crawl depth

    WEBCRAWLERConfig

    maxErrorCount?: number

    Max Error Count. Example: Enter maximum error count

    WEBCRAWLERConfig

    maxUrls?: number

    Max URLs. Example: Enter maximum number of URLs to crawl

    WEBCRAWLERConfig

    minTimeBetweenRequests?: number

    Throttle (ms). Example: Enter minimum time between requests in milliseconds

    WEBCRAWLERConfig

    reindexIntervalSeconds?: number

    Reindex Interval (seconds). Example: Enter reindex interval in seconds

    WEBCRAWLERConfig