Skip to content

The Discovery Standard

Discovery News

The Discovery StandardSubscribe

The Discovery Standard

Skip to content

Guest Articles

Form of Production, Metadata and Embedded Data after the 2006 E-Discovery Amendments

Rule 34 provides for discovery and production of all types of electronically stored information – whether visible content, embedded data or metadata[1]  - but the Rule does not regulate the extent, if any, that metadata and embedded data must be produced in any particular case.[2]  The focus in Rule 34 is on the form or forms of production of the electronically stored information is sought in discovery.[3]  While this necessarily implicates issues about metadata and embedded data, it is not synonymous with it.


Electronically stored information is created and stored in a “native” file format which uniquely reflects the software application operating in conjunction with a computer’s file system.  This “native” format contains information apparent to the user, but also may carry embedded data and metadata accessible only to the application that created it.  While some information (e.g., file name, size and date information) is routinely furnished in discovery, other types of metadata and embedded data are rarely relevant and present significant functional and practical drawbacks when produced as part of a native file format.[4]  The production of metadata and embedded data also raises potential ethical and practical issues which may complicate privilege review.[5] 


Rule 34(b) outlines a procedure for reaching agreement on the form or forms of production of electronically stored information.  As a matter of best practice, and pursuant to the intent of Rule 26(f) the issues should be discussed promptly at the initial early meet and confer.  In Kentucky Speedway, LLC v. National Association of Stock Car Auto Racing, Inc.,[6] for example, the court stressed the role of Rule 26(f) under the 2006 Amendments in refusing a belated attempt to seek metadata.  A party seeking production of metadata or embedded data should therefore identify its interest in its production promptly so as to allow the producing party to attempt to accommodate the request or raise an appropriate objection, as is implicitly provided in Rule 34(b).[7]  An early and practical agreement is the preferred outcome,[8] and failure to discuss the issue or make a specific request waives objection to production without the specific metadata or embedded data.[9]  Typically, where the issue is contested in a timely fashion, courts require a showing of particularized need[10]  or relevance[11] before ordering production of metadata and embedded data.[12]  Even if relevant, the cost-benefit limitations under Rule 26(b)(2)(C) may apply.[13] 


Under amended Rule 34(b), if a party has not requested a particular form or forms of production and no agreement or a court order exists, a producing party has the option to produce either in the form or forms “in which [electronically stored information] is ordinarily maintained” or in a “reasonably useful” format.  The reference to the form in which it is maintained appears to encompass production in some variation of native file format with appropriate metadata and embedded data determined by the type of ESI involved.[14]  The “reasonably useable” option allows for production in other formats,[15] such as hard copy or imaged formats provided that any metadata or embedded data necessary to make the format comparably useful to the way it is available to the producing party, especially in regard to search capabilities, is also furnished.[16]


The need for metadata and embedded data varies depending upon the type of electronically stored information involved and the issues in the case.  It is very difficult to generalize about the topic, although valiant efforts have been made.[17]  In actual practice, the topic is negotiated on a case-by-case basis.[18]  Parties frequently agree to produce email in convenient and difficult to alter forms that faithfully preserve the appearance of the content so that the images of individual pages can be Bates numbered and readily used in depositions and at trial.[19]  Such page image production formats, principally the Tagged Image File Format (“TIFF”) and Adobe Portable Document Format (“PDF”) are typically accompanied by “load files,” i.e., ancillary files containing textual content and relevant metadata.  In other cases, such as production of spreadsheets, sound recordings, animated content, web pages, video and drawings (which do not lend themselves to production in imaged form) or large databases, production is often best accomplished in “native” or “quasi-native” file formats because of the complexity involved.[20]


In any event, absent an agreement to the contrary, it is advisable to preserve sources of ESI covered by a litigation hold in their native file formats if there is any possibility that metadata or embedded data relating to that electronically stored information may be deemed material.  This preserves the ability to prepare an appropriate extract of any metadata which may be required or to make production in some variant of a native file format or in native file format.  In the leading case of In re Priceline.Com Inc., Sec. Litig.,[21] the court ordered production in “TIFF” and “PDF form but required the original data be maintained in its original native file format for the duration of the litigation.  Indeed, a failure to maintain ESI in native format pending production arguably constitutes spoliation by a party on notice of its possible use, since discoverable metadata and embedded data are no different than other forms of ESI.[22] 


 


[1] For purposes of this article, I adopt the distinction between metadata and embedded data used by the Advisory Committee.  See Committee Note, Fed. R. Civ. P. 26(f): 

For example, production may be sought of information automatically included in electronic files but not apparent to the creator or to readers.  Computer programs may retain draft language, editorial comments, and other deleted matter (sometimes referred to as “embedded data” or “embedded edits”) in an electronic file but not make them apparent to the reader.  Information describing the history, tracking, or management of an electronic file (sometimes called ‘metadata’) is usually not apparent to the reader viewing a hard copy or a screen image. 
 

[2] See Williams v. Sprint/United Management Company, 2005 U.S.LEXIS 21966 **30 (D. Kan. Sept. 29, 2005)(Although Rule 34(b) uses the phrase “in a form or forms in which it is ordinarily maintained,” [the Rule and Comments] provide no further guidance as to whether a party’s production . . .would encompass the electronic document’s metadata”).

[3] See Fed. R. Civ. P. 34(b)(ii)([Unless the parties otherwise agree, or the court otherwise orders:] . . if a request does not specify the form or forms for producing electronically stored information, a responding party must produce the information in a form or forms in which it is ordinarily maintained or in a form or forms that are reasonably usable”).

[4] Individual pages from documents produced in native file formats are difficult or impossible to redact or Bates number and are more difficult to use in depositions or at trial than imaged formats.  A privilege review can also be more difficult to accomplish in a timely and cost effective manner. 

[5] See ABA Formal Opinion 06-442 (Review and Use of Metadata)(advocating scrubbing of metadata to avoid disclosures) but compare the Maryland and New York ethics opinions summarized in John F. Baughman and H. Christopher Boehning, Metadata Ethics [and] Electronic Discovery,” New York Law Journal (Dec. 26, 2006)(cite as: 12/26/2006 N.Y.L.J. 5 (col. 1).

[6] Civ. Action No. 05-138-WOB, 2006 U.S. Dist. LEXIS 92028 (Dec. 18, 2006)(“the issue of whether metadata is relevant or should be produced is one which ordinarily should be addressed by the parties in a Rule 26(f) conference”).

[7] See Fed. R. Civ. P. 34(b)(“If objection is made to the requested form or forms for producing electronically stored information – or , if no form was specified in the request – the responding party must state the form or forms it intends to use.”).

[8] See In re: Celexa and Lexapro Prod. Liab. Litig., supra, 2006 U.S. Dist. Lexis 91590 (E.D. Mo. Nov. 13, 2006) (involving a comprehensive agreement to produce ESI in “any format that generally is searchable and manageable (including native file format or as single page TIFF images with ASCII Text . . . and the following, to the extent applicable, as metadata: author, recipient, date, subject line)”).

[9] Kentucky Speedway, LLC v. National Association of Stock Car Auto Racing, Inc., supra, 2006 U.S. Dist. LEXIS 92028 (Dec. 18, 2006)( Rule 34(b) does not require production of metadata absent a showing of a particularized need and failure to raise issue prior to production waives objection); compare In re Payment Card Interchange Fee and Merchant Discount Antitrust Litigation, No. MD 05-1720(JG)(JO), 2007 U.S. Dist. LEXIS 2650 (E.D.N.Y. Jan 12, 2007), (refusing to find estoppel where production was made without objection ).

[10]  See Wyeth v. Impax Laboratories, No. Civ. A. 06-222-JJF, 2006 U.S. Dist. Lexis 79761 (D. Del. Oct. 26, 2006)(production in native format not required in the absence of foreseeable or necessary requirement for accessing metadata); accord, Sedona Principle 12 (“Unless it is material to resolving the dispute, there is no obligation to preserve and produce metadata absent agreement of the parties or order of the court.”).    

[11] See Hagenbuch v. 3B6 Sistemi Elettronici Industriali S.R.L., No. 04 C 3109, 2006 2006 U.S. Dist Lexis 10838 (N.D. Ill. March 8, 2006)(metadata ordered produced because relevant to establishing chronology of case).

[12] Some decisions requiring production in native format do not explain the basis for their ruling.  See Nova Measuring Instruments Ltd. v. Nanometrics, Inc, 417 F. Supp.2d 1121 (N.D. Cal. March 6, 2006)(ordering production in native format because producing party “offers no reason why” the order should not issue); accord, In re Verisign, Inc. Securities Litigation, No. C 02-02270 JW, 2004 U.S. Dist. Lexis 22467 (March 10, 2004)(upholding prior order of magistrate judge as not clearly erroneous because redaction and bates numbering difficulties do not “transcend all reasonableness.”).   It was possible to read the former Rule 34 as requiring production of “identical” copies, including information not ordinarily visible to a viewer.  See Hagenbuch v. 3B6 Sistemi Elettronici Industriali S.R.L, supra, 2006 2006 U.S. Dist. Lexis 10838 at *7-8 (N.D. Ill. March 8, 2006).  Amended Rule 34(b) clarifies that this is only one of several options available to a producing party in the absence of a request and an agreement or court order to that effect.  

[13] See Suggested Protocol for Discovery of Electronically Stored Information, U.S. Dist. Maryland (2007), at p. 25 (“[M]eta-data, however, may not be relevant to the issues present or, if relevant, not be reasonable subject to discovery given the Rule 26(b)(2)(C) cost-benefit factors.”), available at http://www.mdd.uscourts.gov/news/news/ESIProtocol.pdf.  The Suggested Protocol is a “working model” not adopted by the District Court which is offered to assist counsel.  Id, p. 1.

[14] SeeThomas Y. Allman, The Impact of the Proposed Federal E-Discovery Rules, 12 Rich. J. L. & Tech. 13 at *21, n. 72 (2006) (noting the observation by some Advisory Committee members that to “technically adept lawyers and experts” the reference could include metadata and embedded data).

[15] See Production – Form of Production, EDRM Project (differentiating between production in Paper, Quasi-Paper, Quasi-Native and Native), available at http://www.edrm.net/wiki/index.php/Production_-_Form_of_Production.

[16] See Committee Note, Rule 34(b)(“ But [this option] does not mean that a responding party is free to convert electronically stored information from the form in which it is ordinarily maintained to a different form that makes it more difficult or burdensome for the requesting party to use the information efficiently in the litigation.   If the responding party ordinarily maintains the information it is producing in a way that makes it searchable by electronic means, the information should not be produced in a form that removes or significantly degrades this feature.”).  See In re Payment Card Interchange Fee and Merchant Discoount Antitrust Litigation, supra, 2007 WL 121426 at *4 (implying, erroneously, in the author’s view, that searchable TIFF images are not reasonably usable within the meaning of the Committee Note).

[17] See Para . 11, Principles re: Meta-Data, Maryland Protocol, supra, n. 13 at p. 26, suggesting that “consideration” should be given to producing information such as author, date of creation and modification dates, but that metadata and embedded data need not be “routinely” produced, except that (at p. 28), certain highly technical embedded data, such as spreadsheet formulas, hidden columns, etc. should be produced in “appropriate” cases.   The Protocol also contains a complex and highly convoluted procedure designed to accommodate concerns about ethical issues involving review of embedded data by parties that should not ethically do so. (requiring a promise not to either access or review the data)(at p. 26).

[18] See Production – Form of Production, EDRM Project, supra, n. 15 (“Due to the Proposed [now effective] Amendments, it is becoming more common for opposing parries to produce the extracted text and metadata from electronic documents in a negotiated database format along with the document images.”).

[19] See The Ponca Tribe of Indians of Oklahoma v. Continental Carbon Company, No. CIV -05-445-C, 2006 U.S. Dist. Lexis  (W.D. Okla. Oct. 11, 2006)(approving production of the “equivalent of pictures of the emails” and denying request for production in “their native electronic format”).

[20] See Production – Form of Production, EDRM Project, supra (noting that production of an IBM AS400 database could be made as an ASCII, comma-delimited file with associate file and field structural information, i.e., as a “Quasi-Native” format), http://www.edrm.net/wiki/index.php/Production_-_Form_of_Production.

[21] 233 F.R.D. 88, 91 (D. Conn. 2005). 

[22] See, passim,  In re Telxon Corp. Securities Litigation, No. 5:98CV2876, 1:01CV1078, 2004 WL 3192729 (N.D. Ohio 2004)(noting concern over explanations for changes in metadata).

Author Bio: Thomas Y. Allman is Senior Counsel to Mayer, Brown, Rowe & Maw LLP and previously served as Senior Vice President, General Counsel and Secretary of BASF Corporation.

Test 1
Test 2
Test 3