Shows & Panels
- The 2014 Big Picture on Cyber Security
- AFCEA Answers
- Ask the CIO
- Building the Hybrid Cloud
- Connected Government: How to Build and Procure Network Services for the Future
- Continuing Diagnostics and Mitigation: Discussion of Progress and Next Steps
- Federal Executive Forum
- Federal Tech Talk
- The Future of Government Data Centers
- The Future of IT: How CIOs Can Enable the Service-Oriented Enterprise
- The Intersection: Where Technology Meets Transformation
- Maximizing ROI Through Data Center Consolidation
- Mitigating Insider Threats in Virtual & Cloud Environments
- Modern Mission Critical Series
- Moving to the Cloud. What's the best approach for me
- Navigating Tough Choices in Government Cloud Computing
- The New Generation of Database
- Satellite Communications: Acquiring SATCOM in Tight Times
- Targeting Advanced Threats: Proven Methods from Detection through Remediation
- Transformative Technology: Desktop Virtualization in Government
- The Truth About IT Opex and Software Defined Networking
- Value of Health IT
- Air Traffic Management Transformation Report
- Cloud First Report
- General Dynamics IT Enterprise Center
- Gov Cloud Minute
- Government in Technology Series
- Homeland Security Cybersecurity Market Report
- National Cybersecurity Awareness Month
- Technology Insights
- The Cyber Security Report
- The Next Generation Cyber Security Experts
Shows & Panels
Library of Congress to receive entire Twitter archive
Tuesday - 12/6/2011, 1:51pm EST
Federal News Radio
The Library of Congress and Twitter have signed an agreement that will see an archive of every public Tweet ever sent handed over to the library's repository of historical documents.
"We have an agreement with Twitter where they have a bunch of servers with their historic archive of tweets, everything that was sent out and declared to be public," said Bill Lefurgy, the digital initiatives program manager at the library's national digital information infrastructure and preservation program. The archives don't contain tweets that users have protected, but everything else — billions and billions of tweets — are there.
Lefurgy joined the Federal Drive with Tom Temin and Amy Morris Tuesday morning to talk about the library's digital mission.
Using new technical processes it has developed, Twitter is moving a large quantity of electronic data from one electronic source to another. "They've had to do some pretty nifty experimentation and invention to develop the tools and a process to be able to move all of that data over to us," Lefurgy said.
The Library of Congress has long been the repository of important, historical documents and the Twitter library, as a whole, is something historic in itself.
"We were excited to be involved with acquiring the Twitter archives because it's a unique record of our time," Lefurgy said. "It's also a unique way of communication. It's not so much that people are going to be interested in what you or I had for lunch, which some people like to say on Twitter."
Researchers will be able to look at the Twitter archive as a complete set of data, which they could then data-mine for interesting information.
"There have been studies involved with what are the moods of the public at various times of the day in reaction to certain kinds of news events," Lefurgy said. "There's all these interesting kinds of mixing and matching that can be done using the tweets as a big set of data."
One benefit for the Library of Congress in receiving this large data set is that it's been forced to stretch itself technologically.
"It's been difficult at times," Lefurgy said. "But we firmly believe that we have to do this kind of thing because we anticipate that we'll be bringing in large data sets again into the future. We don't know specifically what, but certainly there's no sign of data getting smaller or less complicated or less interesting."
The library's Twitter partnership comes amid a renewed push by the administration and the National Archives and Records Administration for federal agencies to better archive their own social media postings and emails as potential government records.
"We're basically in the same situation as the National Archives, only on a much larger scale," Lefurgy said. "We tend to have a much larger perspective in terms of what we collect."