Content statistics – Google Search Appliance Administrative API Developers Guide: Protocol User Manual

Page 40

Advertising
background image

Google Search Appliance: Administrative API Developer’s Guide: Protocol

40

<?xml version=’1.0’ encoding=’UTF-8’?>
<entry xmlns=’http://www.w3.org/2005/Atom’

xmlns:gsa=’http://schemas.google.com/gsa/2007’>

<id>http://gsa:8000/feeds/diagnostics/http%3A%2F%2Fexample.com%2Fdoc.html</id>
<updated>2009-03-26T05:41:43.724Z</updated>
<app:edited xmlns:app=’http://purl.org/atom/app#’>

2009-03-26T05:41:43.724Z

</app:edited>
<link rel=’self’ type=’application/atom+xml’

href=’http://gsa:8000/feeds/diagnostics/http%3A%2F%2Fexample.com%2Fdoc.html’/>

<link rel=’edit’ type=’application/atom+xml’

href=’http://gsa:8000/feeds/diagnostics/http%3A%2F%2Fexample.com%2Fdoc.html’/>

<gsa:content name=’entryID’>http://example.com/doc.html</gsa:content>
<gsa:content name=’backwardLinks’>0</gsa:content>
<gsa:content name=’forwardLinks’>0</gsa:content>
<gsa:content name=’isCached’>1</gsa:content>
<gsa:content name=’lastModifiedDate’>-1</gsa:content>
<gsa:content name=’collectionList’>Default,default_collection</gsa:content>
<gsa:content name=’date’>-1</gsa:content>
<gsa:content name=’currentlyInFlight’>0</gsa:content>
<gsa:content name=’contentSize’>641</gsa:content>
<gsa:content name=’contentType’>text/html</gsa:content>
<gsa:content name=’crawlFrequency’>normal</gsa:content>
<gsa:content name=’crawlHistory’>
1245977534

2

Unchanged.

1245955634

1

Crawled: New Document

1245951054

2

Unchanged.

</gsa:content>
<gsa:content name=’latestOnDisk’>1245977534</gsa:content>
</entry>

Content Statistics

Get content statistics for each kind of documents using the contentStatistics feed.

Common query parameters for all requests:

Content statistics entry properties:

Parameter

Description

collectionName

Name of the collection which you want to list. The default value is the last
used collection.

Property

Description

<Entry Name>

The content type of documents, such as plain/text.

avgSize

The average document size of this content type.

maxSize

The maximal document size of this content type.

minSize

The minimal document size of this content type.

numFiles

The file number of this content type.

totalSize

The total document size of this content type.

Advertising