EMC Corporation Corporate Headquarters: Hopkinton, MA 01748-9103
EMC
®
Kazeon-eDiscovery
Version 4.8.0
EMC believes the information in this publication is accurate as of its publication date. The information is subject to change without notice.
THE INFORMATION IN THIS PUBLICATION IS PROVIDED "AS IS." EMC CORPORATION MAKES NO REPRESENTATIONS OR WARRANTIES OF ANY KIND WITH RESPECT TO THE INFORMATION IN THIS PUBLICATION, AND SPECIFICALLY DISCLAIMS IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE.
Use, copying, and distribution of any EMC software described in this publication requires an applicable software license. For the most up-to-date listing of EMC product names, see EMC Corporation Trademarks on EMC.com. Adobe and Adobe PDF Library are trademarks or registered trademarks of Adobe Systems Inc. in the U.S. and other countries. All other trademarks used herein are the property of their respective owners.
The IS1200 software is based in part on software licenses from the following: Outside In® Content Access © 1991-2015, Chicago, Inc.
Open Source code from www.java2s.com called the itext.asian.jar available at: http://www.java2s.com/Code/Jar/GHI/itext-asian.jar.htm
Copyright 2009 - 12 Demo Source and Support. All rights reserved In part on the work of the Independent JPEG Group.
Code from Inxight Software, Inc. Copyright © 1996-2015. All rights reserved. www.inxight.com. Certain icons used by the Kazeon Web applications come from the Silk Icon set
(http://www.famfamfam.com/lab/icons/silk/)
Figures... ...vii
Tables ... ...ix
Preface ... ...xi
Chapter 1
Introduction
About the IS1200 ... 2Extending Server Functionality ... 2
SourceOne Server Background ... 2
SourceOne Archive Connector Deployment Overview ... 3
SourceOne Archive Connector Deployment ...3
Chapter 2
Installing and Configuring the Connector
Installation Overview... 8Minimum Requirements for Connector Deployment ... 8
Configure the SourceOne Worker Server to Avoid DocumentManagement Web Service Deadlock ... 9
Creating IS1200 Access Credentials for SourceOne Servers ... 10
Creating IS1200 Access Credentials on the SourceOne Security Group ... 11
Obtaining SourceOne Server Network Addresses... 16
Installing the EMC SourceOne Archive Connector License ... 17
Adding the Access Credentials to the IS1200 Identity Vault ... 18
Chapter 3
Registering EMC SourceOne Repositories
About Registration ... 22
Web-Admin Registration ... 23
CLI Registration ... 25
Other Helpful CLI Commands... 26
Chapter 4
Workflow Changes for EMC SourceOne Archive
Connector
Limitations... 30MetaData Changes ... 32
Custodian Mapping Changes in Web-Admin ... 33
Security and Permissions... 34
Supported Jobs and Services... 35
Setting SourceOne Jobs and Action Scope ... 36
Selecting SourceOne Mapped Folders... 36
Selecting SourceOne Email, Files, or Sharepoint ... 37
Selecting SourceOne Scope by Time... 39
Selecting SourceOne Scope by Display Name ... 40
Selecting SourceOne Scope by Date... 41
Selecting SourceOne Owners, the Lookup Button ... 42
Selecting SourceOne Scope Using Keywords... 43
Appendix A
Configuring EMC SourceOne Servers
IS1200 - SourceOne Authentication Requirements... 46Use SSO for ExShortcut and Default (Anonymous) for Search and SearchWS Services ... 46
Use BOTH SSO and Default (Anonymous) Authentication on Separate Worker Sites ... 47
Appendix B
Troubleshooting
Handling SourceOne Archive Indexing Errors... 50Fixing Result Filter Timeouts... 52
Errors When Registering SourceOne Server Data Repositories ... 53
Reconciling Object Counts for SourceOne Repository Collections and Classifications ... 54
Title Page
1 The Web-Admin Licenses Page... 17
2 Add License Dialog Box ... 18
3 The Web-Admin Identity Vault Page... 18
4 Add Identity Dialog Box ... 19
5 The Web-Admin Add Repository Tab ... 23
Title Page
As part of an effort to improve its product lines, EMC periodically releases revisions of its software and hardware. Therefore, some functions described in this document may not be supported by all versions of the software or hardware currently in use.The product release notes provide the most up-to-date information on product features.
Contact your EMC technical support professional if a product does not function properly or does not function as described in this document.
Note: This document was accurate at publication time. Go to EMC Online Support (https://support.emc.com) to ensure that you are using the latest version of this document.
Audience This guide is intended for both administrators and users (legal users, litigation support, or others doing searches or reports) of the
EMC Kazeon - eDiscovery server (or the IS1200).
The guide covers the initial deployment and setup of the IS1200 EMC SourceOne Archive Connector and the subsequent workflow changes necessary for the IS1200 to work with the connector. The connector allows the IS1200 to register, classify, and do searches and reports on
EMC SourceOne servers.
Related Documentation
IS1200 Installation and Quickstart Guide
- describes installing and configuring the IS1200 server software.
IS1200 Web-Admin User and Configuration Guide
IS1200 Web-Search User Guide
- describes using Web-Search to perform basic and advanced searches.
IS1200 Web-Reports User Guide
- describes using Web-Reports to create and use basic and advanced reports.
IS1200 eDiscovery Case Manager Administrators and Supervisors Guide
- for legal representatives, a primer of all the web-based Interfaces above for performing eDiscovery.
IS1200 Command Line Interface Reference Guide
- describes the IS1200 Command Line Interface and all its commands. Follow these steps to download IS1200 document from the web: 1. Go to
https://support.emc.com
and click the SUPPORT BYPRODUCT option in the home page.
2. In the Find a Product field, enter Kazeon. From the product selection list, choose one of the sub-headers (such as Kazeon
ECS) and click the Find button.
3. Kazeon ECS window is displayed. Click the link for Documentation.
4. In the left-navigation menu, choose a version level to display the available documents.
Conventions used in this document
EMC uses the following conventions for special notices:
DANGER indicates a hazardous situation which, if not avoided, will result in death or serious injury.
WARNING indicates a hazardous situation which, if not avoided, could result in death or serious injury.
NOTICE is used to address practices not related to personal injury.
Note: A note presents information that is important, but not hazard-related. IMPORTANT
An important notice contains information essential to software or hardware operation.
Typographical conventions
EMC uses the following type style conventions in this document. Normal Used in running (nonprocedural) text for:
• Names of interface elements (such as names of windows, dialog boxes, buttons, fields, and menus) • Names of resources, attributes, pools, Boolean expressions, buttons, DQL statements, keywords,
clauses, environment variables, functions, utilities
• URLs, pathnames, filenames, directory names, computer names, filenames, links, groups, service keys, file systems, notifications
Bold Used in running (nonprocedural) text for:
• Names of commands, daemons, options, programs, processes, services, applications, utilities, kernels, notifications, system calls, man pages
Used in procedures for:
• Names of interface elements (such as names of windows, dialog boxes, buttons, fields, and menus) • What user specifically selects, clicks, presses, or types
Italic Used in all text (including procedures) for: • Full titles of publications referenced in text • Emphasis (for example a new term) • Variables
Courier Used for:
• System output, such as an error message or script
• URLs, complete paths, filenames, prompts, and syntax when shown outside of running text
Courier bold Used for:
• Specific user input (such as commands)
Courier italic Used in procedures for: • Variables on command line • User input variables
< > Angle brackets enclose parameter or variable values supplied by the user
[ ] Square brackets enclose optional values
Where to get help EMC support, product, and licensing information can be obtained as follows.
EMC product, and licensing information can be obtained as follows.
Product information —For documentation, release notes, software updates, or for information about EMC products, licensing, and service, go to the EMC Online Support at:
https://support.emc.com
Technical Support —Go to EMC Online Support and click Service Center. You will see several options for contacting EMC Technical Support. Note that to open a service request, you must have a valid support agreement. Contact your EMC sales representative for details about obtaining a valid support agreement or with questions about your account.
Documentation Feedback
Your suggestions help us continue to improve the accuracy,
organization, and overall quality of the user publications. Please send your comments or opinions on this document to:
[email protected] Revision History
{ } Braces indicate content that you must specify (that is, x or y or z)
... Ellipses indicate nonessential information omitted from the example
Table 1 Revision History Details Revision Date Description
September 2015 Updated the Deduplication section in “Glossary”.
Added information about support for SourceOne 7.2 version in Kazeon 4.8.
October 2014 Added information about support for SourceOne 7.1 SP2 version in Kazeon 4.8.
August 2014 Added information about support for SourceOne 7.1 SP1 version in Kazeon 4.8.
This chapter provides general background information, and an introduction, to the products and concepts the reader must understand to use this guide.
The guide is provided as a companion to the IS1200 Web-Admin
User and Configuration Guide which should be read first as it contains
most of the basic IS1200 server setup and maintenance information on which this guide builds.
Topics include:
◆ About the IS1200 ... 2
◆ Extending Server Functionality ... 2
◆ SourceOne Server Background ... 2
◆ SourceOne Archive Connector Deployment Overview ... 3
◆ SourceOne Archive Connector Deployment ... 3
About the IS1200
The EMC Kazeon - eDiscovery (IS1200) is an integrated hardware and software system that provides information management solutions enabling organizations to efficiently and cost effectively classify, manage, and retrieve data. It provides consistent information visibility and control across distributed files, integrates seamlessly with existing infrastructures, and scales to support billions of files for searching, reporting, backup search and recovery, and file migration and archiving. The search capabilities are especially designed to facilitate eDiscovery situations.
Extending Server Functionality
The IS1200’s standard functionality can be expanded with optional add-on modules like the EMC SourceOne Archive Connector.
The EMC SourceOne Archive Connector requires an additional license for the IS1200 and allows the IS1200, IS1200 federations, and the Remote Office Edition, to access and work with all the content on EMC SourceOne repositories. IS1200 Information Centers
(Federations) also support EMC SourceOne Archive Connector access.
SourceOne Server Background
Because many companies rely on email as the primary means of communications, both internally and with customers, they have massive repositories of email often with large file attachments. EMC SourceOne Email Management is a comprehensive,
policy-based system that automatically collects, organizes, indexes and retains messages and associated attachments and stores them in designated archives connected to shared storage.
SourceOne message archiving can provide significant mailbox storage space reductions and reduce backup and recovery time. SourceOne works for small, medium and large corporations, as well as companies with 100,000 inboxes.
archived by the SourceOne Email Management and helps with compliance and governance regulations.
Additionally, SourceOne offers “user directed archiving” allowing users to classify business-important records. Through a simple drag and drop in their inboxes, users can place messages into pre-defined archives with set retention periods. EMC SourceOne also supports file archiving and SharePoint archiving.
SourceOne Archive Connector Deployment Overview
If the EMC SourceOne Archive Connector is ordered with the original IS1200 appliance purchase, the appropriate optional module license is automatically included in the IS1200 basic license and is routinely installed along with the server license.
If the EMC SourceOne Archive Connector is purchased after the original installation, the EMC SourceOne Archive Connector license key must be added to the installation before the EMC SourceOne Archive Connector can be configured or accessed. See the Installing License Keys chapter of the IS1200 Web-Admin User and Configuration Guide for details on obtaining and installing optional modules licenses for existing installations.
SourceOne Archive Connector Deployment
The EMC SourceOne Archive Connector software is automatically installed with all IS1200 installations, the EMC SourceOne Archive
Connector license key simply activates it.
To use EMC SourceOne Archive Connector purchased with a new installation, simply follow the standard hardware and software installation and configuration instructions in the Kazeon
IS1200 Installation and Quickstart Guide and then use the rest of this
guide to configure the EMC SourceOne Archive Connector.
1. On each EMC SourceOne server that will be accessed, create a set of user credentials for the IS1200 to use when accessing those servers. See “Creating IS1200 Access Credentials for SourceOne Servers” on page 10 for details.
2. Obtain registration information for all EMC SourceOne servers, see “Obtaining SourceOne Server Network Addresses” on page 16 for details.
3. Use Web-Admin to add the EMC SourceOne Archive Connector license key to the IS1200, then quit and relaunch the Web-Admin. See the Installing License Keys chapter of the IS1200 Web-Admin
User and Configuration Guide for details on installing optional
modules licenses for existing installations.
4. Add the SourceOne access credentials to the IS1200 Identity Vault, see “Adding the Access Credentials to the IS1200 Identity Vault” on page 18 for details.
5. Ensure an appropriate Active Directory authentication server is registered with the IS1200, see “Ensuring IS1200 AD
Authentication Server Works with Lookup Button” on page 19 for details.
6. Register the SourceOne repositories you want to work with. See “Registering EMC SourceOne Repositories” on page 21 for details. Once registered, SourceOne repositories can be classified and collected from.
7. Create any custom classification rules needed to extract situation-specific or case-specific metadata. See the Policies:
Classification Extraction and Assignment Rules chapter of the IS1200 Web-Admin User and Configuration Guide for details.
8. SourceOne repositories may be classified directly, or used as collection sources to traditional CIFS or NFS target repositories. The targets may then be processed with all standard IS1200 features.
• If the SourceOne is to be used as a collection source,
start a targeted collection against one or more mapped folders inside the SourceOne repositories and collect to a standard CIFS or NFS target repository. The following standard
SourceOne criteria may be used to limit the collection (at least one is mandatory):
– Mail Received Date Range
– SourceOne Display Name (For Email Only)
– File Accessed, Modified, and Created Date Range (for F iles and Sharepoint only)
– Owners
– One or more Keywords
Schedule a one-time, or recurring, deep classification against the target repositories of the collection above.
• Alternately, use in-place processing on the registered SourceOne repositories with specific filters to classify just a subset of the SourceOne repositories. The following criteria may be used to limit what is classified. At least one filter is mandatory:
– Select one or more options from Email, Files, or Sharepoint – Mapped Folders
– Mail Received Date Range
– SourceOne Display Name (For Email Only)
– File Accessed, Modified, and Created Date Range (for F iles and Sharepoint only)
– Owners (This is Mandatory) – One or more Keywords
9. Monitor classification job results to ensure all objects are successfully indexed, and review errors and exceptions.
10. Login to Web-Search to search for objects stored in the registered SourceOne repositories. View search result documents in native or non-native formats, or apply Actionable Services (such as Copy, Export, Download, Delete Metadata Only, ReClassify, or Tagging).
11. Login to Web-Reports to create reports about objects stored on registered SourceOne repositories. View report result documents in native or non-native formats, or apply Actionable Services (such as Copy, Export, Download, Delete Metadata Only, ReClassify, or Tagging).
This chapter describes how to install the EMC SourceOne Archive
Connector.
Topics include:
◆ Installation Overview ... 8
◆ Minimum Requirements for Connector Deployment ... 8
◆ Configure the SourceOne Worker Server to Avoid DocumentManagement Web Service Deadlock ... 9
◆ Creating IS1200 Access Credentials for SourceOne Servers ... 10
◆ Creating IS1200 Access Credentials on the SourceOne Security Group ... 11
◆ Obtaining SourceOne Server Network Addresses... 16
◆ Installing the EMC SourceOne Archive Connector License ... 17
◆ Adding the Access Credentials to the IS1200 Identity Vault ... 18
◆ Ensuring IS1200 AD Authentication Server Works with Lookup Button... 19
◆ Registering Metadata Repositories for SourceOne ... 20
Installation Overview
Installing and configuring the IS1200 for the EMC SourceOne Archive
Connector requires the following basic steps:
◆ Ensure the EMC SourceOne server meets the minimum requirements to support the EMC SourceOne Archive Connector.
◆ Configure the EMC SourceOne worker server to avoid potential deadlock of the DocumentManagement web service.
◆ Obtain existing, or create new, access credentials to the EMC SourceOne server that allow the IS1200 to connect and work with all EMC SourceOne servers that will be registered as data repositories. An existing primary service account may be used, however, if this is not permitted, new credentials must be created.
◆ Obtain the network addressing information for each EMC SourceOne server.
◆ Install the EMC SourceOne Archive Connector license on the IS1200.
◆ Add the IS1200 EMC SourceOne access credentials to the IS1200 Identity Vault.
◆ Ensure the Active Directory authentication server registered with IS1200 is one that contains the usernames and email addresses of the users whose email is archived on the EMC SourceOne Email Management Archive Server. This allows the Lookup button (found in IS1200 dialogs) to find registered mailbox owners (custodians) when setting up jobs such as collections from EMC SourceOne Email Management repositories.
◆ Create and register metadata repositories for all EMC SourceOne repositories to be registered.
Each basic step above is explained in detail below.
Minimum Requirements for Connector Deployment
Before the EMC SourceOne Archive Connector can be deployed, you must ensure that all EMC SourceOne servers to be registered meet the following minimum requirements:
◆ If the Worker Server is running IIS 7 then ensure that IIS 6 Management compatibility is also installed.
◆ All EMC SourceOne servers must be running any of the following supported versions: • 6.6 • 6.7 • 6.8 • 7.1 SP1 • 7.1 SP2 • 7.2
◆ The IS1200 must be running version 4.4.0 or later.
◆ All mapped folders on the EMC SourceOne servers must be fully indexed. This is required to use the “keywords” filters as
described in “Selecting SourceOne Scope Using Keywords” on page 43.
◆ All EMC SourceOne servers must have sufficient bandwidth to accommodate the increased load the EMC SourceOne Archive
Connector will place on it.
Configure the SourceOne Worker Server to Avoid
DocumentManagement Web Service Deadlock
On the EMC SourceOne worker server, edit the file:
c:\windows\Microsoft.NET\Framework\v2.0.50727\CONFIG\mac hine.config
Remove the settings: allowDefinition="MachineOnly" from the "ProcessModel" section, by doing the following:
From: ... <sectionGroup name="system.web"
type="System.Web.Configuration.SystemWebSectionGroup, System.Web, Version=2.0.0.0, Culture=neutral, PublicKeyToken=b03f5f7f11d50a3a"> ... <section name="processModel" type="System.Web.Configuration.ProcessModelSection,
System.Web, Version=2.0.0.0, Culture=neutral,
PublicKeyToken=b03f5f7f11d50a3a" allowDefinition="MachineOnly"
allowLocation="false" /> To:
... <sectionGroup name="system.web"
type="System.Web.Configuration.SystemWebSectionGroup, System.Web, Version=2.0.0.0, Culture=neutral, PublicKeyToken=b03f5f7f11d50a3a"> ... <section name="processModel" type="System.Web.Configuration.ProcessModelSection,
System.Web, Version=2.0.0.0, Culture=neutral,
PublicKeyToken=b03f5f7f11d50a3a" allowLocation="false" />
Note: The red text allowDefinition="MachineOnly"‘is missing from the “To” example.
Refer to:
http://msdn.microsoft.com/en-us/library/aa309408%28VS.71%29.aspx for more details about this setting.
2. Once the changes are made, restart IIS by doing the following: • From the Start menu, click Run.
• In the Open box, type cmd, and then click OK.
• At the command prompt, type the following: iisreset See the following Microsoft article for more details:
http://www.microsoft.com/technet/prodtechnol/WindowsServer200 3/Library/IIS/95826e7a-bac4-4e1f-bcb6-c52d49c9d7f4.mspx?mfr=true
Creating IS1200 Access Credentials for SourceOne Servers
The preferred account is the primary service account, or a similar account that is part of the EMC SourceOne Security group. The account must be registered on the Active Directory server that authenticates for all the EMC SourceOne servers that will be
registered on the IS1200 as data repositories. This account must have access to all mapped folders on the EMC SourceOne server and be able to search for and download any of the emails and non-email objects in these folders.
If the EMC SourceOne administrator is not allowed to provide the primary service account, then you must create an IS1200 credentials account as
described in the section immediately below, “Creating IS1200 Access Credentials on the SourceOne Security Group”.
After creating the account, it must be stored on the IS1200 as an identity, see “Adding the Access Credentials to the IS1200 Identity Vault” on page 18 for details.
Note: Identity usernames and passwords for EMC SourceOne repositories may contain only US-ASCII characters.
Creating IS1200 Access Credentials on the SourceOne Security Group
This section should only be used if the primary service account cannot be used as the IS1200 identity to access the EMC SourceOne servers. In this
1. Start by setting up a new user account on the Active Directory (AD) server used by the EMC SourceOne servers you intend to register as data repositories.
For example add the account: is1200idaccount
2. Add this account to the EMC SourceOne server security group on the AD server.
Generally, this means ensuring the new account is part of the
SourceOne Security Group and that group has been granted all
permissions for all Mapped Folders, as shown below for one folder.
4. Then open the Application Configuration menu (left pane below) and ensure the SourceOne Security Group is entered in the Security
group name: field found in the Retrieval tab as shown below.
5. Enable Basic Authentication on the EMC SourceOne Worker server. If the worker server host is Windows 2003, do the following:
a. Open Internet Information Services (IIS), select ExDocMgmtSvc in the left pane,
and right-click it to open it’s Properties dialog as shown below.
b. In the ExDocMgmtSvc Properties dialog, select the
Directory Security tab and click the Edit button in the
Authentication and access control section.
Generally ensure the dialog options are as shown above, specifically ensure the Basic Authentication checkbox is set as shown above and close the dialogs.
If the worker server host is Windows 2008, do the following: a. Open Internet Information Services (IIS),
select ExDocMgmtSvc in the left pane,.
b. Double-click Authentication under IIS in the middle pane. c. When the Authentication pane opens,
6. With both 2003 and 2008,
stop and re-start the ExDocMgmtSvc Application pool. 7. Verify the credentials work.
• Login to the EMC SourceOne server Web Search application with the new user account
• Select Search Type as 'Administrator'
• Find messages from one or more mapped folders
• Double click a message and verify that it can be downloaded
Obtaining SourceOne Server Network Addresses
To register a EMC SourceOne repository, the IS1200 requires specific network configuration information about the EMC SourceOne server. Ask a EMC SourceOne administrator to provide the following information about each EMC SourceOne server that will be registered.
◆ A repository reference name (what IS1200 users will see it referred to as in menus where the EMC SourceOne server may be selected as a source repository)
◆ The IP Address or HostName of the EMC SourceOne Search Web service
◆ The port number of the EMC SourceOne Search Web service
◆ The port number of the EMC SourceOne Document Management Service
Installing the EMC SourceOne Archive Connector License
If you purchased the EMC SourceOne Archive Connector with your original IS1200 appliance, the license key has already been installed. If you purchased the EMC SourceOne Archive Connectorlater, do the following to install the license key:
1. Locate the EMC SourceOne Archive Connector license key number and copy it to the system clipboard. It is most likely in the body of the licensing email that came when the module was purchased.
2. Login to the IS1200 Web-Admin application.
3. Under the Administration heading in the left navigation pane, click Licensing, the Licenses page opens.
4. Click Add License in the toolbar, the Add License dialog box appears
Figure 2 Add License Dialog Box
Paste the license key from the clipboard into the empty field, and click Add.
5. Click Refresh in the Licenses page toolbar to confirm the installation.
For complete details on installing and managing IS1200 license keys, see the IS1200 Web-Admin User and Configuration Guide chapter on
Administration: Backups, Licenses, and Healthinfo.
Adding the Access Credentials to the IS1200 Identity Vault
Once you have the username and password obtained from “Creating IS1200 Access Credentials on the SourceOne Security Group” on page 11, do the following to add them to the IS1200 Identity Vault. They are needed to register the EMC SourceOne repositories later. 1. Login to the IS1200 Web-Search application.
2. From the Web-Search left navigation pane under Authentication, click Identity Vault, the Identity Vault page appears.
3. Click Add in the toolbar, the Add Identity dialog opens.
Figure 4 Add Identity Dialog Box
4. Enter values for the following fields:
Identity Name. Enter a name to identify the EMC SourceOne credentials, maximum of 24 alphanumeric characters, NO spaces.
User Name. Enter the EMC SourceOne credentials username Note: Identity usernames and passwords for EMC SourceOne repositories may contain only US-ASCII characters.
Password. Enter the EMC SourceOne credentials password, and enter it again in the confirm field.
Domain. Enter the domain name where the username can be authenticated.
5. Click Submit to add the identity to the identity vault. For complete details on using and managing identities, see the
Identity Vault chapter of the IS1200 Web-Admin User and Configuration Guide.
Ensuring IS1200 AD Authentication Server Works with Lookup Button
See the Configuring External Authentication chapter of the
IS1200 Web-Admin User and Configuration Guide for details on
configuring AD authentication.
Registering Metadata Repositories for SourceOne
Create and register metadata repositories for EMC SourceOne server you intend to register as a data repository. Standard metadata repository sizing requirements apply.
See the Repository Registration and Management chapter of the
IS1200 Web-Admin User and Configuration Guide for complete details
This chapter discusses how to register EMC SourceOne servers as IS1200 data repositories, allowing them to be classified, searched, and reported on.
Topics include:
◆ About Registration... 22
◆ Web-Admin Registration ... 23
◆ CLI Registration ... 25
◆ Other Helpful CLI Commands ... 26
About Registration
After completing the installation and configuration described in the previous chapter, the EMC SourceOne servers must be registered as data repositories with the IS1200. Registration is required before a EMC SourceOne server may be classified, searched, or otherwise worked with by the IS1200.
The EMC SourceOne Archive Connector requires the IS1200 to be running version 4.4 or later and the EMC servers to be running any of the supported EMC SourceOne versions 6.6, 6.7, 6.8, 7.1 SP1, or 7.1 SP2.
EMC SourceOne servers may be registered as data repositories from both the GUI, and the Command Line Interface (CLI).
Web-Admin Registration
To register a EMC SourceOne server as an IS1200 data repository using Web-Admin, do the following:
1. Login to Web-Admin.
2. From the Web-Admin navigation pane under Repositories, click
SourceOne Archive, the Add Repository tab appears.
Figure 5 The Web-Admin Add Repository Tab
3. If it is not already selected, select SourceOne Archive from the Repository Type drop-down menu.
4. Fill in the following fields:
Name: Enter a reference name for this repository. The IS1200
uses reference names, instead of the full repository filepaths, in all
menus where a user must choose a repository, for example
when choosing a data repository for a classification or
four-byte UTF-8 encoded characters). Reference names may
include some special characters, see
“Special Characters” on
page 73
for details.
When seen in a menu, the reference name will be expandable
to show all the mapped folders it contains, allowing one or
more mapped folders to be selected.
Metadata Repository Select a metadata repository from the drop-down menu to associate with the EMC SourceOne server, be sure to select one of the metadata repositories created specifically for your EMC SourceOne repositories, and to assign a metadata repository only once.
SourceOne Options:
Use this section to enter the information described in “Obtaining SourceOne Server Network Addresses” on page 16.
Document Management Service Parameters
IP Address/Server Name. Enter the IP address, or hostname, of the EMC SourceOne server’s Document Management service.
Port. Enter the port number of the EMC SourceOne server’s Document Management Service.
Search Web Service Parameters
IP Address/Server Name. Enter the IP address, or hostname, of the EMC SourceOne server’s Web service.
Port. Enter the port number of the EMC SourceOne server’s Search Web service.
Connection Credentials
Identity. From the drop-down menu, select the identity created to hold the access credentials needed for the IS1200 to access the EMC SourceOne server. See “Creating IS1200 Access Credentials for SourceOne Servers” on page 10 and “Adding the Access Credentials to the IS1200 Identity Vault” on page 18 for more information.
Note: Identity usernames and passwords for EMC SourceOne repositories may contain only US-ASCII characters.
CLI Registration
To register a EMC SourceOne server as an IS1200 data repository from the (CLI), login to the CLI as admin and issue the following command:
Note: This method is recommended only for system administrators already familiar with the CLI.
add SourceOneArchive <referenceName> kazfs
<metadataRepName> as <identity> search-service-host <search-server_ip-host> search-service-port
<search-server_port#> document-service-host <document-server_ip-host> document-service-port <document-server_port#>
Where the <bracketed text> is to be supplied by the user as follows:
<referenceName> A unique name to assign to the server for the IS1200 to use when displaying the server as a potential source repository in job setup screens, for example in a deep classification setup. This is usually the “common” name that users know the server by.
<metadataRepName> The name of the metadata repository for the EMC SourceOne servers (if not specified, the IS1200 chooses an available metadata repository).
<identity> The name of the identity stored in the IS1200 Identity Vault containing the access credentials the IS1200 needs to access this EMC SourceOne server. See “Creating IS1200 Access Credentials for SourceOne Servers” on page 10 and “Adding the Access Credentials to the IS1200 Identity Vault” on page 18 for more information <search-server_ip-host> Host name or IP address of the SourceOne’s search web service.
<search-server_port#> (Optional) Port number of the SourceOne’s search web service [ default 80 ]
<document-server_ip-host> Host name or IP address of the SourceOne’s document web service.
Other Helpful CLI Commands
Once a EMC SourceOne server is registered as a data repository, the following Command Line Interface commands are helpful in managing it. For complete descriptions of the syntax and usage of these
commands, see the IS1200 Command Line Interface Reference Guide.
SET FS <fsname> <state>
The set fs command may be used to toggle any repository on or off line.
<fsname> The IS1200 reference name assigned to the repository to affect.
<state> The desired state for the repository.
REMOVE FS <fsname>
The remove fs command is used to delete, or un-register, a repository. The repository must be off-line first.
<fsname> The IS1200 reference name assigned to the repository to affect.
SHOW FS parameter
Display the configuration and parameters of registered repositories. Follow show fs with any of the following parameters:
detail Enter an information type to display (maps|stats|all|metadata)
type Enter a repository type to filter by (kaz|data|localkaz|exchange|ev)
fs-name Enter a specific repository name.
status Enter to show health statistics for registered repositories.
SHOW REPOSITORY <parameter)
Display the configuration and parameters of registered repositories. Follow show fs with any of the following parameters:
repository name Name of the repository to show
This chapter discusses the workflow differences, compared to standard IS1200 operation, required by repositories accessed by the IS1200 EMC SourceOne Archive Connector.
Topics include:
◆ Limitations ... 30
◆ MetaData Changes... 32
◆ Custodian Mapping Changes in Web-Admin ... 33
◆ Security and Permissions... 34
◆ Supported Jobs and Services ... 35
◆ Setting SourceOne Jobs and Action Scope ... 36
◆ Selecting SourceOne Mapped Folders ... 36
◆ Selecting SourceOne Email, Files, or Sharepoint... 37
◆ Selecting SourceOne Scope by Time... 39
◆ Selecting SourceOne Scope by Display Name ... 40
◆ Selecting SourceOne Scope by Date ... 41
◆ Selecting SourceOne Owners, the Lookup Button... 42
◆ Selecting SourceOne Scope Using Keywords... 43
Once the EMC SourceOne Archive Connector is installed and configured, and all the EMC SourceOne archives are registered as data repositories, the IS1200 can begin standard operations and provide standard services for all registered EMC SourceOne Archive repositories. This chapter details the differences in features,
workflow, and operations between using registered SourceOne repositories and using standard NFS or CIFS data repositories.
Limitations
Not all standard jobs and services are supported with registered EMC SourceOne repositories. The following exceptions exist:
◆ Only four EMC SourceOne repositories may be registered with an IS1200 cluster.
◆ EMC SourceOne repositories are “source” only, the IS1200 cannot write or delete message files on them. Consequently, the IS1200 cannot maintain access times during crawls, cannot enforce legal hold on them, nor perform collections, copies, legal exports, or downloads with EMC SourceOne repositories as targets.
◆ EMC SourceOne repositories may not be “discovered” using the
Web-Admin Environment Discovery tab.
◆ EMC SourceOne repositories only allows deep classifications.
◆ The IS1200 cannot enforce legal hold or retention on the EMC SourceOne file system. It can record legal hold in the associated metadata repositories.
◆ In Web-Admin, the repository health check function has limited scope and can only report whether the EMC SourceOne servers are accessible, or not. The IS1200 can pause and resume jobs based on this status.
◆ Access Control Lists (ACLs) information and file extensions of the file system objects and Sharepoint documents cannot be found in the registered EMC SourceOne archive repositories. EMC SourceOne for file systems does not store the file systems permissions and file extensions during the file archiving activity.
◆ In the Web-Admin Deep Crawl setup screen, the “Remove Deleted Documents from Kazeon Index” checkbox is not supported for SourceOnce repositories.
Do a Metadata Synchronization Service instead.
◆ Objects (e-mails, files, Sharepoint documents, etc.) larger than 100 Mbytes can not be downloaded from EMC SourceOne
repositories.
◆ EMC SourceOne does not provide the proper html tags for wiki, issues, and Sharepoint item objects in the Sharepoint archive. For these objects, IS1200 shows the Document type as Unknown.
◆ EMC SourceOne does not save owner information for files during file archiving. IS1200 captures the archived file access
information to represent the custodian of the files. In some cases, EMC SourceOne does not save the access permission details. For these objects, the system shows custodian information as
Unassigned.
MetaData Changes
The following metadata fields are new, or changed for EMC SourceOne servers.
Indexed fields are searchable.
Table 2 Metadata Mapping for EMC SourceOne Archive Repositories
Kazeon Metadata Fields Description Indexed?
SourceOneArchiveOwner SourceOne object Owners (Owner name is stored, not e-mail address or legacy DN)
Yes
SourceOneArchiveFolderName Name of mapped folder where the SourceOne object was found Yes SourceOneArchiveMetaDataModificati
onTime
Metadata modification time. For internal use only. No
SystemMTime Set to e-mail receive time (SourceOne NPM property Id 4) Yes SystemCTime Set to e-mail receive time (SourceOne NPM property Id 4) Yes SystemCreationTime Set to e-mail receive time (SourceOne NPM property Id 4) Yes SystemATime Set to e-mail receive time (SourceOne NPM property Id 4) Yes SystemOwner Not relevant for SourceOne objects. Set to Kazeon. N/A
SystemExtension Not set for sourceone object N/A
SystemFilePath sourceone://<s1-repository-name>/<folder-id>/<object-id> Yes
SystemFileSize Size of SourceOne object Yes
OriginalsourceCustodian Multivalued field, Set to SourceOne Owner(s) by default. Yes OriginalsourceLocation sourceone://<s1-repository-name>/<folder-id>/<object-id> Yes SourceOneArchiveFileName Filename of a SourceOne archived object Yes SourceOneArchiveObjectType Type of objects in SourceOne archive Yes SourceOneArchiveOriginalFileLocation Location of file objects from where it’s been archived in SourceOne No SourceoneArchiveFileVersion File version of a Sharepoint object No SourceoneArchiveOriginalOwner The intersection of owner filter with all owners from SourecOne archive.If
no owner filter is specified then the value will be all owners belonging to an object in SourceOne
Custodian Mapping Changes in Web-Admin
Prior to release 4.7.0, the IS1200 extracts only a single owner (custodian) for each email classified. The custodian is determined according to the custodian mapping defaults set in. Because EMC SourceOne objects can have multiple owners (for journalize
messages), the EMC SourceOne Archive Connector enables the IS1200 to extract a new multi-valued metadata property that records multiple owners.
IS1200 4.7.0 and later introduces a way to restrict showing multiple custodians for a SourceOne object. Now, SourceOne owner filter is mandatory for every crawl. Custodian values are limited to only the values specified in the filter and never goes beyond that.
For example, if an email contains four owners as OwnerA, OwnerB, OwnerC, and OwnerD and you run a crawl with OwnerA and OwnerB as owner filters then, OwnerA and OwnerB are shown as Custodians for this object instead of listing all owners.
"SourceOneArchiveOwner" metadata field holds the owner
information which allows you to still search for all owners associated with an object. For more information, please see Table 2 on page 32. Now, the search result filter time is reduced because all the associated owners of an object are not present in the result filter.The Custodian filter is mandatory.
The following LDAP parameter can be used to disable/enable the mandatory custodian.
/module/config/sourceone/mandatorycustodian
By default the value of this LDAP parameter is “ON”, so that crawls on SourceOne cannot be run with owner filter.
To display the current value of this LDAP setting, use the following command:
sh config param /module/config/sourceone/mandatorycustodian
To change the default LDAP setting from ON to OFF do the following:
Email on EMC SourceOne repositories have the custodian field set to their owner(s) by default, however the default mapping can be overridden with exception setting.
Because EMC SourceOne emails may have multiple custodians, the following situations should be noted:
◆ If a search returns an email that contains multiple custodians, and that email is Copied, Moved, Downloaded, or Exported, the metadata created on the target will include all multiple custodians, subject to the following bullet.
◆ The default number of exportable custodians per email is
unlimited. However, a configuration parameter exists to limit this to a specified maximum. If this parameter is set, some emails may not have all their custodians recorded on the target metadata when exported.
◆ For example, if an email is sent to a distribution list with thousands of recipients (giving this email thousands of
custodians), and the limit is lower than the number of users in the distribution list, not all custodians will be recorded on the target.
◆ If the Lookup button is used to select custodians for a search or collection job, all the user addresses found will be searched for, however, if the results are exported the username metadata that is added to the target repository only contains the short name used in the Lookup button to search for those addresses.
For example, if the Lookup button is used to search for “Bob Smith”, and it returns “[email protected]”, “[email protected]”, and “[email protected]”, any emails found and exported will only have the metadata custodian “Bob Smith” recorded.
See “Selecting SourceOne Owners, the Lookup Button” on page 42 for more details on using the Lookup button.
Security and Permissions
EMC SourceOne Archive Connector does not enforce user-level access
checks or permission checks for user searches or user-initiated jobs and services.
Supported Jobs and Services
The following job types are supported:
◆ Deep classifications from Web-Admin and Processing from the
eDiscovery Case Manager
◆ Web-Admin and eDiscovery Case Manager collections may be
performed from EMC SourceOne repositories
◆ Web-Admin Metadata Classifications and Reclassifications.
◆ EMC SourceOne metadata may be deleted.
The following Actionable Services (actions) are supported for
Web-Search results:
◆ Tagging - adding or changing tags in the EMC SourceOne results metadata repository
◆ Copy FROM - results may be copied from EMC SourceOne repositories to registered IS1200 target repositories (CIFS, NFS, Centera, Celerra, etc).Copied e-mails are stored to the MSG format, and do not preserve any proprietary formats the EMC SourceOne repository may have used.
◆ Download - FROM only - search result files may be downloaded from EMC SourceOne repositories to registered IS1200 target repositories (CIFS, NFS, Centera, Celerra, etc) as Metadata in CSV format, or as native documents to ZIP files.
◆ Legal Export FROM only - results may be exported from EMC SourceOne repositories to load formats for other legal
applications, such as Concordance, to registered IS1200 target repositories (CIFS, NFS, Centera, Celerra, etc).
◆ Reclassify - result may be reclassified for more targeted analysis with different rule sets.
Setting SourceOne Jobs and Action Scope
EMC SourceOne servers may contain many terabytes of data. Consequently, when the IS1200 presents a dialog to select a EMC SourceOne repository—in job setup screens like deep crawls,
collections, or in-place processing—a mechanism to allow the user to specify sub-sections of the EMC SourceOne repository can be very helpful.
The IS1200 provides several ways to limit selection scope on a EMC SourceOne repository:
◆ Select one or more options from Email, Files, or Sharepoint
◆ Mapped Folders
◆ Mail Received Date Range
◆ SourceOne Display Name (For Email Only)
◆ File Accessed, Modified, and Created Date Range (for F iles and Sharepoint only)
◆ Owners
◆ One or more Keywords
These folders and filters work the same in all the job and action setup screens listed below:
◆ Web-Admin Deep Classifications
◆ Web-Admin Single-step Collections
◆ Web-Admin In-place Processing
◆ eDiscovery Case Manager Add Case: Repository Selection tabs
◆ eDiscovery Case Manager Add Collections tabs
The following sections describe these folders and filters.
Selecting SourceOne Mapped Folders
When the IS1200 presents a repository selection dialog, it lists all the registered EMC SourceOne repositories as possibilities, but allows restricting the selection to one or more mapped folder in those repositories.
The screen below shows a repository selection screen for EMC SourceOne repositories.
Figure 6 Web-Admin Single-step Collection Tab
Notice that the Show drop-down menu has EMC SourceOne selected as the repository type, and that each EMC SourceOne repository listing in the Available box is proceeded by a plus sign. When a EMC SourceOne server listing plus sign is clicked, that repository’s mapped folders are expanded beneath it.
Any mapped folder may be clicked, and then selected for the job or service by clicking the blue right-arrow to the right of the Available box.
Selecting SourceOne Email, Files, or Sharepoint
Another way to limit the selection scope on a EMC SourceOne repository is by selecting one or more options from Email, Files, or
Sharepoint objects. This is done with the include options shown
Note: Selecting one of the options from Emails, Files, or Sharepoint is mandatory.
If Email option is selected, then following type of objects are filtered from SourceOne archive:
Emails message, contact, Meeting, Task, PostedNote, JournaleEntry, StickyNote, Appointment, DeliveryReport, ReadReport, OtherReport, FaceTime, BloomBerg, AIKONIX, IMLOGIC, MINDALIGN, NOTICE, TaskRequest, UBSChat, Other
Note: Lotus Domino non-email object crawls are not supported.
If Files option is selected, then following type of objects are filtered from SourceOne archive:
Files, Other
If Sharepoint option is selected, then following type of objects are filtered from SourceOne archive:
Documents, Sharepoint Item, Wiki, Picture, Issues, Other Note: By default Email option is selected. But one or all of the three options can be selected.
Note: In Web-Search previewer can only render
JPeg,BMP,PNG,GIF,TIFF1,TIFF2 picture formats crawled from SouceOne archive which has sharepoint picture library's.
In the Web-Search to search for different types of SourceOne archive objects following object types can used in the filter
◆ Appointment: 67 ◆ Document: 73 ◆ Delivery Report: 74 ◆ Read Report: 75 ◆ Other Report: 76 ◆ Facetime: 77 ◆ Bloomberg: 78 ◆ Aikonix: 79 ◆ Imlogic: 80 ◆ Mindalign: 81 ◆ Notice: 82 ◆ Task Request: 83 ◆ UBS chat: 84 ◆ Other: 90 ◆ File: 3000 ◆ Sharepoint Item: 500 ◆ Wiki: 501 ◆ Picture: 502 ◆ Issues: 503
Selecting SourceOne Scope by Time
Note: Email Received Date Filter is displayed only if Email option is selected as mentioned in the above section “Selecting SourceOne Email, Files, or Sharepoint”.
between which you want to set the scope. For example, if you set a:
From Time of 1 Jan 2008 00:00:00 and To Time of 1 Apr 2009 00:00:00
then the filter will include messages which are sent
ON or AFTER 1 Jan 2008 00:00:00 and BEFORE 01 Apr 2009 00:00:00 (That is, all messages sent in the months Jan, Feb, and Mar.)
Alternately, enter a number (of days) in the field to limit the scope to messages sent or received between the time the job is run and that number of days ago, inclusive.
Note: Using this filter requires that the IS1200 was configured with the correct time zone during it’s initial setup. See the Using kaz_setup.pl section of the Configuring Server Network Connectivity chapter of the
IS1200 Installation and Quickstart Guide for details.
Selecting SourceOne Scope by Display Name
Note: SourceOne Display Name (For Email only) is displayed only if Email option is selected in the above section “Selecting SourceOne Email, Files, or Sharepoint” on page 37.
Another way to limit the selection scope on an EMC SourceOne repository is by display name. This filter allows you to filter the Emails by Sender or Recipient. This is done with the options shown below:
Search on domain names (such as [email protected]) is not allowed. You must enter the display name, not the SMTP address.
For example, if you enter Alana in the SourceOne Display Name field then the filter will include messages whose sender/recipient matches Alana like, Alana B Gaymon, Alana B Raysor, Alana, and so on. EMC SourceOne performs a contains search in this case. To filter multiple display names in the field SourceOne Display Name, a semi-colon separated list can be entered where the values are OR’d. Supported wild cards are at the beginning or end of the Display Name. For example, Alana*; *Cluff. Embedded wild cards are not supported.
To use SourceOne Display Name filter for terminated users (non AD users), turn off the mandatory LDAP parameter. Please see the section “Custodian Mapping Changes in Web-Admin” on page 33 to turn off the mandatory LDAP parameter.
Selecting SourceOne Scope by Date
Note: Time Filters (For Files and Sharepoint Only) is displayed only if Files or Sharepoint option is selected in the section “Selecting SourceOne Email, Files, or Sharepoint” on page 37.
Another way to limit the selection scope on an EMC SourceOne repository is by date. This filter allows you to filter the Files or Sharepoint items by date. This is done with the options shown below:
Click the calendar icons to open a dialog to choose dates for the following fields:
Last Accessed (for Files Only): The last date when a file was accessed. This field is not applicable for Sharepoint items.
Last Modified: The last date when a file or a Sharepoint item was modified.
Alternately, enter a number (of days) in the field to limit the scope.
Note: Filtering using this option requires full-text indexing enabled on all selected business folders in SourceOne.
Selecting SourceOne Owners, the Lookup Button
Another way to limit the selection scope on a EMC SourceOne repository is by user, owner, or custodian. This is done with the Owner Filter shown below.
The screen above is from a Web-Admin Single-Step Collection setup tab, but the Lookup button on it works the same everywhere it is found in Web-Admin, the eDiscovery Case Manager, and in Web-Search. Click the Lookup button to open the standard Lookup dialog.
Note: For the Lookup button to work correctly, the appropriate Active Directory authentication server must be registered with the IS1200. See “Ensuring IS1200 AD Authentication Server Works with Lookup Button” on page 19 for details.
Lookup is Active Directory based, Lookup for Lotus Domino users is not supported.
If manually entering Active Directory custodians, click the Add button to add a semi-colon separated list.
When manually entering a list of Domino custodians, use a semi-colon separated list in Notes Canonical format.
For example: CN=John Smith/OU=Boston/O=Company or
CN=John Smith/OU=Boston/O=Company;CN=John Smith/OU=SanJose/O=Company;CN=John A. Smith/ OU=Boston/O=Company;CN=Smith/OU=SanJose/O=Company To delete the custodians from the Owner Filter list, select the custodian in the grid and click Delete button.
Note: If mandatory custodian LDAP parameter is ON then the SourceOne Owner filter grid displays the custodian list for the previous crawl. In
Web-Admin, these preserved custodians are shown in repository level and in eDiscovery Case Manager they are shown against per case.
Selecting SourceOne Scope Using Keywords
Lastly, selection scope may be limited by defining keywords to filter using the section shown below.
Enter a space separated list of keywords to filter by.
Note: Keyword filtering will not work unless all mapped folders are fully indexed by the EMC SourceOne server.
This appendix discusses configuring EMC SourceOne servers to allow both basic and single sign on authentication. This section explains various options for setting up EMC SourceOne server authentication to allow the IS1200 to properly communicate with the SourceOne servers.
Topics include:
◆ IS1200 - SourceOne Authentication Requirements ... 46
◆ Use SSO for ExShortcut and Default (Anonymous) for Search and SearchWS Services... 46
◆ Use BOTH SSO and Default (Anonymous) Authentication on Separate Worker Sites ... 47
IS1200 - SourceOne Authentication Requirements
The EMC Kazeon - eDiscovery (IS1200) must be able to connect to its registered SourceOne repository’s Search and SearchWS sites in order to function properly with those servers. However, SourceOne servers can be configured to use either default(anonymous) or Single Sign On (SSO) authentication.
EMC SourceOne servers are sometimes configured to use SSO authentication to facilitate ExShortcut, see the EMC SourceOne 6.6
Installation Guide for details about when, why, and how ExShortcut is
used. For the IS1200 to function properly in this situation, the SourceOne server must be set up to use both default (anonymous) and SSO authentication, specifically, the Search and SearchWS sites should NOT be modified from their default configuration states. Neither
Windows Authentication nor Basic Authentication options should be
enabled for the SearchWS site.
This can be accomplished in the following two ways.
Use SSO for ExShortcut and Default (Anonymous) for Search and SearchWS Services
The ExShortcut (Mobile) website is not dependant on the Search or SearchWS sites, and can coexist when only ExShortcut uses SSO. If SSO authentication is needed only for ExShortcut, and not needed for other SourceOne services, then simply configure the ExShortcut (Mobile) website for SSO and the Search and SearchWS sites for default(anonymous).Please refer to the following figure of SearchWS service's authentication configuration:
Use BOTH SSO and Default (Anonymous) Authentication on Separate Worker Sites
If resources permit, simply create two instances of the SourceOne Web Services. Setup one to respond to SSO authentication, and the second to respond to default (anonymous) authentication. Then, configure the IS1200 to work with the default (anonymous)This chapter details procedures useful for troubleshooting IS1200 SourceOne data repositories.
Topics include:
◆ Handling SourceOne Archive Indexing Errors ... 50
◆ Fixing Result Filter Timeouts ... 52
◆ Errors When Registering SourceOne Server Data Repositories... 53
◆ Reconciling Object Counts for SourceOne Repository Collections and Classifications... 54
◆ SourceOne Connector Message Deduplication ... 55
The following issues may be encountered when SourceOne Servers are registered as IS1200 data repositories.
Handling SourceOne Archive Indexing Errors
While an IS1200 job or service is accessing a SourceOne Archive server using the SourceOne Connector, the SourceOne Archive may return index inconsistency errors to the connector. When this occurs, the connector forwards the error to the IS1200 and the IS1200 aborts the current job or service and records the error in the job log. When these errors occur, contact your SourceOne Archive
administrator to have them fixed, and then re-run the IS1200 job or service.
The following jobs or services may be aborted by these indexing errors:
◆ Deep Classification, Inplace processing, Single Step Collections, and Legal Application Collections on SourceOne archives without filters or with filters like date, owner, size etc.
◆ Inplace processing, Single Step Collections, and Legal
Application Collections on SourceOne archive using a SourceOne
keyword filter.
When these errors abort a SourceOne Archive job or service, the IS1200 reports the problem in several places:
◆ In Web-Admin,
◆ Also in the Web-Admin Job List tab,
clicking the Detail icon in the tool-bar produces the screen below:
◆ In the eDiscovery Case Manager, in the CaseDetail > Activities >
Collection Activities tab as shown below.
For Compound Jobs
If classifications include more than one URLs from one or more SourceOne Archive Servers, the following applies:
1. As soon as the classification detects an indexing error, the job is aborted. Only the URLs from the SourceOne Archives completely classified before the errors are detected are indexed.
2. The remaining URLs are not classified.
3. After the errors are fixed, the classification must be re-submitted for all SourceOne Archives that were not fully classified before the error aborted the job.
Fixing Result Filter Timeouts
Occasionally a search on a SourceOne repository may display only a limited number of results and when either a Results Filter tab filter is applied, or a Results Filters sub-heading’s “
View more
” link is clicked, a timeout error may display resulting in no results being returned. The timeout error can be managed in two ways:◆ Change the method the new results are displayed by
Changing the Filter Method
The IS1200 prepares search results displays two way, one called A1 and A2. A1 is the default for search results with less than 2000 listings, and for non-SourceOne search results is usually the faster method, which is why it is the default. However, for SourceOne searches, A2 is usually faster, so a configuration parameter is available to force all SourceOne result displays to use A2.
Be aware that A2 does not prevent timeouts, it just makes them less likely for SourceOne result displays. After switching to A2, timeouts can still occur for larger result sets.
To force A2 for SourceOne, do the following on all IS1200 nodes: 1. Open configuration file
/opt/openkaz/config/SearchGroupings.xml
2. Find the following line.
<common name="useA1" value="true" />
3. Change true to false so the line looks as follows.
<common name="useA1" value="false" />
4. Save the file.
5. Stop and re-start the cluster.
Changing the IS1200 System Timeout Value
See the Changing the Search and Review Filters Timeout Values heading in the Configuration Files and Utilities appendix of the IS1200
Web-Admin User and Configuration Guide.
Errors When Registering SourceOne Server Data Repositories
If the SourceOne Worker host does not have ASP.NET properly installed, or if IIS was installed after Windows Communication
Foundation (WCF) was installed, you may experience issues
connecting to the
ExDocMgmtSvc
hosted IIS service. If these issues occur while registering SourceOne servers as data repositories you may see error “405 – Method Not Allowed” like following as an example:-Failed to connect to ES1Archive '10.31.6.212' using
identity 'ID_ES1' : Failed to execute authenticate
operation on the Document Management service on host
10.31.6.212 HTTP Transport error : '405' - 'Method
Not Allowed'
The issue is explained in more detail in the following Microsoft support link:
http://msdn.microsoft.com/en-us/library/ms7522
52(v=vs.90).aspx
According to Microsoft, if the application mappings are not correct they may be remedied using the following Microsoft tools:
◆
aspnet_regiis.exe
◆
ServiceModelReg.exe
The link gives the following two examples for using these tool: (Procedures will vary based on IIS and .NET Framework versions)
On the Worker machine,
1. Register ASP.NET and enable the aspnet_isapi web service extension for IIS:
%WINDIR%\Microsoft.NET\Framework\v2.0.50727\aspn
et_regiis” –i –enable
2. Install and register the WCF Service Model extensions in IIS:
"%WINDIR%\Microsoft.Net\Framework\v3.0\Windows
Communication Foundation\ServiceModelReg.exe" –i
3. Now, try to register EMC SourceOne repository from IS1200.
Reconciling Object Counts for SourceOne Repository Collections and Classifications
If search result counts for email archives made by the SourceOne server do not match collection or classification result counts on a SourceOne data repository made by the IS1200, then check the following:If operating in the United States in an area were daylight savings time is in effect, you must also have an appropriate
timezone
set on the IS1200. See the Using kaz_setup.pl section of the Configuring ServerThe following are the more frequently used zones: US/Alaska US/Central
US/Eastern US/Hawaii US/Mountain US/Pacific
SourceOne Connector Message Deduplication
In some corner cases, duplicate messages can be archived in a SourceOne mapped folder resulting in search counts from the SourceOne server and Web-Search dis-agreeing.
This can be avoided using the LDAP configuration entry:
/module/config/sourceone/removeduplicates
By default this LDAP parameter is configured “off” and the IS1200 does not de-duplicate messages in SourceOne mapped folders. If set to “on”, then the IS1200 does de-duplicate messages in SourceOne mapped folders, however this takes additional processing time. To set the LDAP entry “on” use the following command:
set config param /module/config/sourceone/removeduplicates value "on"
To set the LDAP entry “off” use the following command:
set config param /module/config/sourceone/removeduplicates value "off"
To display the current value, use the following command: