Crawler access rules, Updating a crawl schedule – Google Search Appliance Administrative API Developers Guide: Protocol User Manual

Page 18

Advertising
background image

Google Search Appliance: Administrative API Developer’s Guide: Protocol

18

The response is as follows:

<?xml version=’1.0’ encoding=’UTF-8’?>
<entry xmlns=’http://www.w3.org/2005/Atom’

xmlns:gsa=’http://schemas.google.com/gsa/2007’>
<id>http://gsa:8000/feeds/config/crawlSchedule</id>
<updated>2008-12-11T06:29:35.862Z</updated>
<link rel=’self’ type=’application/atom+xml’

href=’http://gsa:8000/feeds/config/crawlSchedule’/>

<link rel=’edit’ type=’application/atom+xml’

href=’http://gsa:8000/feeds/config/crawlSchedule’/>

<gsa:content name=’entryID’>crawlSchedule</gsa:content>
<gsa:content name=’isScheduledCrawl’>0</gsa:content>

</entry>

Updating a Crawl Schedule

To update the crawl schedule, send an authenticated PUT request to the following URL:

http://Search_Appliance:8000/feeds/config/crawlSchedule

The following example changes the crawl schedule:

<?xml version=’1.0’ encoding=’UTF-8’?>
<entry xmlns=’http://www.w3.org/2005/Atom’

xmlns:gsa=’http://schemas.google.com/gsa/2007’>
<gsa:content name=’entryID’>crawlSchedule</gsa:content>
<gsa:content name=’isScheduledCrawl’>1</gsa:content>
<gsa:content name=’crawlSchedule’>0,0300,360 2,0000,1200</gsa:content>

</entry>

The following example changes crawl mode to continuous crawl:

<?xml version=’1.0’ encoding=’UTF-8’?>
<entry xmlns=’http://www.w3.org/2005/Atom’

xmlns:gsa=’http://schemas.google.com/gsa/2007’>
<gsa:content name=’entryID’>crawlSchedule</gsa:content>
<gsa:content name=’isScheduledCrawl’>0</gsa:content>

</entry>

Crawler Access Rules

Create, retrieve, update, and delete crawler access rules on a search appliance.

Crawler access rules instruct the crawler how to authenticate when crawling protected content, as
shown in the following list of properties:

Property

Description

domain

Windows domain (for NTLM) or empty (for HTTP Basic authorization)

isPublic

Indicates whether users can get results on both the public content
(normally available to everyone) and the secure (confidential) content. The
value can be 1 or 0. For the search appliance, crawler access can let the
search appliance index secure content. If isPublic is 1, then the content
can be searched by anyone. If isPublic is 0, then content can only be
searched by users who can access the secure content.

Advertising