Search icon CANCEL
Subscription
0
Cart icon
Your Cart (0 item)
Close icon
You have no products in your basket yet
Save more on your purchases! discount-offer-chevron-icon
Savings automatically calculated. No voucher code required.
Arrow left icon
All Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletter Hub
Free Learning
Arrow right icon
timer SALE ENDS IN
0 Days
:
00 Hours
:
00 Minutes
:
00 Seconds
Arrow up icon
GO TO TOP
Python Digital Forensics Cookbook

You're reading from   Python Digital Forensics Cookbook Effective Python recipes for digital investigations

Arrow left icon
Product type Paperback
Published in Sep 2017
Publisher Packt
ISBN-13 9781783987467
Length 412 pages
Edition 1st Edition
Languages
Tools
Concepts
Arrow right icon
Authors (2):
Arrow left icon
Chapin Bryce Chapin Bryce
Author Profile Icon Chapin Bryce
Chapin Bryce
 Miller Miller
Author Profile Icon Miller
Miller
Arrow right icon
View More author details
Toc

Table of Contents (18) Chapters Close

Title Page
Credits
About the Authors
About the Reviewer
www.PacktPub.com
Customer Feedback
Dedication
Preface
1. Essential Scripting and File Information Recipes FREE CHAPTER 2. Creating Artifact Report Recipes 3. A Deep Dive into Mobile Forensic Recipes 4. Extracting Embedded Metadata Recipes 5. Networking and Indicators of Compromise Recipes 6. Reading Emails and Taking Names Recipes 7. Log-Based Artifact Recipes 8. Working with Forensic Evidence Container Recipes 9. Exploring Windows Forensic Artifacts Recipes - Part I 10. Exploring Windows Forensic Artifacts Recipes - Part II

Reading office document metadata


Recipe Difficulty: Medium

Python Version: 2.7 or 3.5

Operating System: Any

Reading metadata from office documents can expose interesting information about the authorship and history of those files. Conveniently, the 2007 formatted .docx, .xlsx, and .pptx files store metadata in XML. The XML tags can be easily processed with Python.

Getting started

All libraries used in this script are present in Python's standard library. We use the built-in xml library and the zipfile library to allow us access to the XML documents within the ZIP container.

Note

To learn more about the xml library, visit https://docs.python.org/3/library/xml.etree.elementtree.html. To Learn more about the zipfile library, visit https://docs.python.org/3/library/zipfile.html.

How to do it...

We extract embedded Office metadata by performing the following steps:

  1. Confirm that the input file is a valid ZIP file.
  2. Extract the core.xml and app.xml files from Office file.
  3. Parse XML data and print embedded metadata...
lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $15.99/month. Cancel anytime
Visually different images