Content statistics – Google Search Appliance Administrative API Developers Guide: Protocol User Manual
Page 40

Google Search Appliance: Administrative API Developer’s Guide: Protocol
40
<?xml version=’1.0’ encoding=’UTF-8’?>
<entry xmlns=’http://www.w3.org/2005/Atom’
xmlns:gsa=’http://schemas.google.com/gsa/2007’>
<id>http://gsa:8000/feeds/diagnostics/http%3A%2F%2Fexample.com%2Fdoc.html</id>
<updated>2009-03-26T05:41:43.724Z</updated>
<app:edited xmlns:app=’http://purl.org/atom/app#’>
2009-03-26T05:41:43.724Z
</app:edited>
<link rel=’self’ type=’application/atom+xml’
href=’http://gsa:8000/feeds/diagnostics/http%3A%2F%2Fexample.com%2Fdoc.html’/>
<link rel=’edit’ type=’application/atom+xml’
href=’http://gsa:8000/feeds/diagnostics/http%3A%2F%2Fexample.com%2Fdoc.html’/>
<gsa:content name=’entryID’>http://example.com/doc.html</gsa:content>
<gsa:content name=’backwardLinks’>0</gsa:content>
<gsa:content name=’forwardLinks’>0</gsa:content>
<gsa:content name=’isCached’>1</gsa:content>
<gsa:content name=’lastModifiedDate’>-1</gsa:content>
<gsa:content name=’collectionList’>Default,default_collection</gsa:content>
<gsa:content name=’date’>-1</gsa:content>
<gsa:content name=’currentlyInFlight’>0</gsa:content>
<gsa:content name=’contentSize’>641</gsa:content>
<gsa:content name=’contentType’>text/html</gsa:content>
<gsa:content name=’crawlFrequency’>normal</gsa:content>
<gsa:content name=’crawlHistory’>
1245977534
2
Unchanged.
1245955634
1
Crawled: New Document
1245951054
2
Unchanged.
</gsa:content>
<gsa:content name=’latestOnDisk’>1245977534</gsa:content>
</entry>
Content Statistics
Get content statistics for each kind of documents using the contentStatistics feed.
Common query parameters for all requests:
Content statistics entry properties:
Parameter
Description
collectionName
Name of the collection which you want to list. The default value is the last
used collection.
Property
Description
<Entry Name>
The content type of documents, such as plain/text.
avgSize
The average document size of this content type.
maxSize
The maximal document size of this content type.
minSize
The minimal document size of this content type.
numFiles
The file number of this content type.
totalSize
The total document size of this content type.