Monitoring Source Processing Status - Symantec ediscovery Platform

The Processing Status screen displays the status of the Indexer and Analytics services, statistics on the collected and indexed content, and the indexing progress for each document source in the case.

To view the processing status

1. On the top navigation bar, select a case, then click the Processing module. The Processing Status screen displays.

2. Under the Processing Status tab, the Batch Processing Status section displays the name of the processed batch and percent complete. Two status views are available:

A. The Overall Progress sub-tab provides a high-level view of each processing phase and current status of the case. (If any problems occurred during the last processing cycle, click next to the source for more details.)

The Status column indicates whether each service is running, has successfully completed, or has never been started. The Time Taken column shows how long each phase of processing took to complete.

B. The Appliance Progress sub-tab shows which appliance completed the processing job, and the current status of the appliance. (If you have more than one appliance

provisioned for distributed processing, multiple appliances will be displayed.

To view indexing detail, scroll down to the Indexing Progress section. This area lists each document source, the total number of items enabled for indexing, the number of

individual files, such as Loose Files, PST, and NSF files, the number of message files, and the percentage of the collected content that has been indexed. To view or change a source’s configuration, click the source name (see “Managing Case Sources and Custodians” on page 44).

3. Click the Processing Statistics tab to view statistics of the case after processing.

Note: Move the mouse over a cell in any column to display more information about its content.

This view includes the following processing detail:

A. The Statistic column provides case statistics for the document crawler and indexing services. Choose a specific batch job or All Batches to view the results for all jobs. These statistics are cumulative, and are not reset when the appliance is rebooted.

B. The Messages column includes email documents that may be processed from an email server or archive or from an individual email file (.msg/.eml). It also includes non-email items such as contacts and calendar entries.

C. The Files column includes counts for any non-email documents, excluding email attachments.

D. The Total column combines the total numbers for each row in the previous columns for each statistic.

To view the remaining disk space on an appliance, refer to the section "Maintaining eDiscovery Appliances" in the -System Administration Guide. The following table describes each of the statistics. Hover over any entries in the list to view additional information.

Note: To view file and message warnings, and unprocessed documents or mailboxes, see the Processing > Exceptions screen.

Processing Statistics

Statistic Message Files Total

Total source files Total source files (including

loose files and email containers such as PSTs and NSFs) when this batch was processed

Total source files (including loose files and email containers such as PSTs and NSFs) when this batch was processed Total source documents Total messages after extraction from email containers (PSTs/ NSFs)

Total files after separating out emails and email containers as messages

Total documents found in source locations, including loose files and individual emails

Container files identified

Loose files identified as containers

Known files excluded

Known files excluded due to known file filtering (also known as “de-NIST-ing”)

Known files excluded due to known file filtering (also known as “de-NIST-ing”) Excluded due to processing options Messages excluded from processing based on user- selected processing options

Loose files excluded from processing based on user- selected processing options

Total documents excluded due to user-selected processing options

Already processed

Messages that have already been processed in previous batches

Loose files that have already been processed in previous processing batches

Total documents already processed in previous processing batches Documents selected for processing Messages selected for processing

Loose files selected for processing

Total documents selected for processing

Note: The count and volume of documents selected for processing are calculated after extraction, and will almost always be higher or lower than the count and volume of documents in the original source location (sometimes substantially), depending on the compression ratios of the PST, NSF, and container files.

Excluded during processing

Messages and non- email message documents (like contacts) excluded during processing

Loose files excluded during processing

Total documents excluded during processing Documents accepted for processing Messages accepted for processing (selected minus excluded)

Loose files accepted for processing (selected minus excluded)

Total documents accepted for processing (selected minus excluded)

4. To export the information on the Processing Status screen, click Export. Files extracted from containers Messages processed that were extracted from archive containers (such as ZIP files)

Loose files processed that were extracted from archive containers (such as ZIP files)

Total documents processed that were extracted from archive containers (such as ZIP files) Documents unable to process (errors) Messages that could not be processed due to an error condition (for example, the crawler dropped due to lack of PKI certificates)

Loose files that could not be processed due to an error condition (container files with error during discovery) Unprocessed documents due to case exceptions

Total documents that could not be processed due to an error condition Documents processed (Remove "(post- extraction)" text) Messages processed by the product (prior to deduplication)

Loose files processed by product (prior to deduplication)

Total documents processed by product (prior to deduplication)

Average size Average message

size (including attachments) across all messages processed

Average loose file size across all loose files processed

Average document size across all documents processed Unique documents indexed Unique messages indexed (after deduplication)

Unique loose files indexed (after deduplication)

Total indexed documents (this will match the number of documents returned in an “empty search”) Deduplication % Percentage by which processed message count (volume) was reduced by deduplication Percentage by which processed loose file count (volume) was reduced by deduplication

Percentage by which total processed document count (volume) was reduced by deduplication"

Processing Statistics (Continued)

In document Symantec ediscovery Platform (Page 103-107)