Crawler access rules, Updating a crawl schedule – Google Search Appliance Administrative API Developers Guide: Protocol User Manual
Page 18

Google Search Appliance: Administrative API Developer’s Guide: Protocol
18
The response is as follows:
<?xml version=’1.0’ encoding=’UTF-8’?>
<entry xmlns=’http://www.w3.org/2005/Atom’
xmlns:gsa=’http://schemas.google.com/gsa/2007’>
<id>http://gsa:8000/feeds/config/crawlSchedule</id>
<updated>2008-12-11T06:29:35.862Z</updated>
<link rel=’self’ type=’application/atom+xml’
href=’http://gsa:8000/feeds/config/crawlSchedule’/>
<link rel=’edit’ type=’application/atom+xml’
href=’http://gsa:8000/feeds/config/crawlSchedule’/>
<gsa:content name=’entryID’>crawlSchedule</gsa:content>
<gsa:content name=’isScheduledCrawl’>0</gsa:content>
</entry>
Updating a Crawl Schedule
To update the crawl schedule, send an authenticated PUT request to the following URL:
http://Search_Appliance:8000/feeds/config/crawlSchedule
The following example changes the crawl schedule:
<?xml version=’1.0’ encoding=’UTF-8’?>
<entry xmlns=’http://www.w3.org/2005/Atom’
xmlns:gsa=’http://schemas.google.com/gsa/2007’>
<gsa:content name=’entryID’>crawlSchedule</gsa:content>
<gsa:content name=’isScheduledCrawl’>1</gsa:content>
<gsa:content name=’crawlSchedule’>0,0300,360 2,0000,1200</gsa:content>
</entry>
The following example changes crawl mode to continuous crawl:
<?xml version=’1.0’ encoding=’UTF-8’?>
<entry xmlns=’http://www.w3.org/2005/Atom’
xmlns:gsa=’http://schemas.google.com/gsa/2007’>
<gsa:content name=’entryID’>crawlSchedule</gsa:content>
<gsa:content name=’isScheduledCrawl’>0</gsa:content>
</entry>
Crawler Access Rules
Create, retrieve, update, and delete crawler access rules on a search appliance.
Crawler access rules instruct the crawler how to authenticate when crawling protected content, as
shown in the following list of properties:
Property
Description
domain
Windows domain (for NTLM) or empty (for HTTP Basic authorization)
isPublic
Indicates whether users can get results on both the public content
(normally available to everyone) and the secure (confidential) content. The
value can be 1 or 0. For the search appliance, crawler access can let the
search appliance index secure content. If isPublic is 1, then the content
can be searched by anyone. If isPublic is 0, then content can only be
searched by users who can access the secure content.