
<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>eDiscoverySquad Litigation Support Services &#187; library</title>
	<atom:link href="http://ediscoverysquad.com/Services/library/feed/" rel="self" type="application/rss+xml" />
	<link>http://ediscoverysquad.com</link>
	<description>eDiscoverySquad is a litigation technology and service firm that specializes in helping law firms and their clients</description>
	<lastBuildDate>Thu, 24 Aug 2023 21:02:13 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.6</generator>
		<item>
		<title>Free Email eDiscovery/Archiving data extractor used by Library of Congress</title>
		<link>http://ediscoverysquad.com/free-email-ediscoveryarchiving-data-extractor-used-by-library-of-congress/</link>
		<comments>http://ediscoverysquad.com/free-email-ediscoveryarchiving-data-extractor-used-by-library-of-congress/#comments</comments>
		<pubDate>Thu, 18 Oct 2012 16:30:04 +0000</pubDate>
		<dc:creator>murari</dc:creator>
				<category><![CDATA[eDiscoverySquad Consulting]]></category>
		<category><![CDATA[congress]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[ediscoveryarchiving]]></category>
		<category><![CDATA[email]]></category>
		<category><![CDATA[extractor]]></category>
		<category><![CDATA[free]]></category>
		<category><![CDATA[library]]></category>

		<guid isPermaLink="false">http://ediscoverycloud.net/?p=686</guid>
		<description><![CDATA[<p>I have recently doing some research on open source email/data extraction utilities, and ran across this one (PEDALS) that is used by the Library of Congress for archiving purposes. http://www.digitalpreservation.gov/news/2010/20100924news_article_pedals_email_tool.html I tested it and was pleased by how it performed. &#8230; <a href="http://ediscoverysquad.com/free-email-ediscoveryarchiving-data-extractor-used-by-library-of-congress/">Continue reading <span class="meta-nav">&#8594;</span></a></p><p>The post <a href="http://ediscoverysquad.com/free-email-ediscoveryarchiving-data-extractor-used-by-library-of-congress/">Free Email eDiscovery/Archiving data extractor used by Library of Congress</a> appeared first on <a href="http://ediscoverysquad.com">eDiscoverySquad Litigation Support Services</a>.</p>]]></description>
				<content:encoded><![CDATA[<p>I have recently doing some research on open source email/data extraction utilities, and ran across this one (PEDALS) that is used by the Library of Congress for archiving purposes.</p>
<p><a title="Email extractor" href="http://www.digitalpreservation.gov/news/2010/20100924news_article_pedals_email_tool.html" target="_blank">http://www.digitalpreservation.gov/news/2010/20100924news_article_pedals_email_tool.html</a></p>
<p>I tested it and was pleased by how it performed.  My testing was performed on Microsoft .PST files.  PEDALS parses through the .PST files, and enumerates each .MSG (email), and extracts the associated attachments.   The email body and metadata are stored in an XML file.</p>
<p>To make this useful for litigation review platforms you need to convert the XML to a .CSV file, and store the Email body as extracted text (e.g., a .txt or rich text file).   At this point you could cull the data with a Desktop version of dtSearch, or you can then convert all of the extracted text and attachments to the target media (e.g., searchable postscript based PDF&#8217;s with optimized compression for web based download) and load them into the target Litigation Review software you plan to use.</p>
<p>Of course this process would be easier with some accompanying utilities to make those conversions automatically, and generate the target load file.</p>
<p>Fear not, eDiscoverySquad will be working on making this process easier in the future so be sure to watch our Blog for future updates.</p>
<p>&nbsp;</p>
<p>The post <a href="http://ediscoverysquad.com/free-email-ediscoveryarchiving-data-extractor-used-by-library-of-congress/">Free Email eDiscovery/Archiving data extractor used by Library of Congress</a> appeared first on <a href="http://ediscoverysquad.com">eDiscoverySquad Litigation Support Services</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://ediscoverysquad.com/free-email-ediscoveryarchiving-data-extractor-used-by-library-of-congress/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

 Served from: ediscoverysquad.com @ 2026-04-29 16:32:28 by W3 Total Cache -->