Processing: the blender of e-discovery

Many lawyers and clients don’t understand this expensive phase or its impact

One of the most expensive technical portions of e-discovery is something called “processing.” Nontechnical personnel involved in e-discovery, including many lawyers, have only the vaguest notion of what is done in the processing phase. They know that data that is collected goes in one end and comes out the other ready for review, but what happens in between is a mystery. However, there are some potentially important choices involved in the processing phase that could significantly impact what electronically stored information (ESI) is available for review and production, as well as impact associated costs.

The general objectives of processing include identifying exactly what elements or items of ESI have been submitted for processing, including their associated metadata. This allows intelligent and informed decisions to be made that can reduce the volume of data selected for continuation along the path to review. At the same time, the application of processing technology and analysis to the data needs to be performed under strict standards of quality control and to bear in mind chain of custody requirements.

When data is submitted for processing, it is likely to consist of a variety of types or formats, such as word processing documents, backup files, email files, etc. It is also common for data, including email, to be stored in “container” files, such as .zip files or .pst files, which require extraction of the individual files and emails from their containers. Backup data may need to be restored. Moreover, some data, like data in obsolete formats, may need to be converted before further processing can occur. Each file must be captured along with associated metadata and all of this information must be catalogued.

Opportunities then arise to reduce the volume of data, making review less expensive and in some cases reducing the risk of inconsistent review decisions about the same documents. For example, the data set can be “de-duplicated” in various ways, and “near de-duplication” or identification of similarity or common “concepts” among documents can be achieved. Full-text indexing facilitates the ability to search the data, and search terms can be applied to help separate out clearly irrelevant data.

Typically, certain files will cause problems for the initial application of processing technology; potential examples would include password-protected files or corrupt files. These are called “exceptions,” and decisions need to be made as to how such exceptions are to be handled. For example, to what extent will efforts to crack passwords be pursued? Where exceptions are not resolved or identified, some potentially relevant ESI may never see the light of day.

Some or all of the data may need to be transformed into other formats for purposes of review, depending on the characteristics of the review software that will be used. At this point, quality assurance procedures should be implemented. These might include, for example, looking at samples of the processed data and comparing this output with expectations based on information available before processing.

Another important element of processing from beginning to end is reporting. For example, each element of data should be tracked through each step in the process, and all decision making with respect to selectively reducing the volume of data should be documented. Information as to the impact of those decisions on the universe of data should be readily available to help inform decision making.

In cases with substantial volumes of electronic information, data that is collected must be processed in software designed for discovery purposes before it can be ready for review by attorneys. Processing is relatively expensive as far as the technical elements of e-discovery are concerned, but many lawyers and clients do not understand what it is or how it impacts discovery. There is no need for attorneys to become experts in the minutiae of data-processing technology, but a grasp of its major components can help in understanding the life cycle of data in e-discovery.

About the Author
Adam Cohen

Adam Cohen

Adam Cohen is a Principal with Ernst & Young LLP.  He is the co-author of the annually updated legal treatise “Electronic Discovery:  Law and Practice”, as well as the forthcoming “Social Media:  Managing Legal Risk Through Corporate Policy.” He also is the co-chair of the New York State Bar Association’s eDiscovery Committee and teaches electronic discovery at Fordham Law School. 


Comments

InsideScoop Daily eNewsletter

InsideScoop delivers the latest-breaking news affecting in-house counsel. Get the latest business trends, current corporate litigation, labor developments, technology initiatives and more — FREE. Sign up now!

You have been subscribed! You will receive a confirmation email soon.

See the entire list of InsideCounsel eNewsletters.

Resource Library


7 Simple Strategies for Improving Legal Fee Budgeting Certainty

Understanding the legal fee budgeting paradigm and following seven simple strategies will help you control...

Complimentary White Paper: Best Practices for Meeting Critical eDiscovery Challenges

Packed with practical advice, this white paper discusses best practices for meeting eDiscovery challenges across...

Complimentary White Paper "Key Considerations for Collection Methodologies and Resources"

This white paper addresses the need for companies to reevaluate their current collection policies in...

Moving Matters In-House: How Technology Enables Legal In-Sourcing

Strategically shifting more matters to in-house counsel has proven to be an effective strategy to...

5 Ways to Promote Responsible Content Sharing

Find out five ways that organizations can promote responsible sharing of content among employees by...

Reducing the Costs of eDiscovery from Collection to Court!

Predictive coding is only one of many ways organizations can make eDiscovery faster, cheaper and...

Discovery Shifts to the Cloud

Adoption of Cloud computing continues to gain momentum. How can IT and Legal Teams avoid...

Lower Your Total Cost of Ownership

With the deployment of Proofpoint Enterprise Archive, organizations have realized significant cost savings in automating...

Health and Safety Risks of Counterfeits in the Global Supply...

This whitepaper underscores the prevalence of counterfeits within global supply chains across a number of...

Get the facts you need to Help Implement Sound Legal...

This whitepaper will examine the cases that are setting precedents. Download "Legal Hold and Self-Collection:...

View All »

Advertisement. Closing in 15 seconds.