Skip to content

Commit 93d0bde

Browse files
committed
doc: added FAQ
1 parent 8d04075 commit 93d0bde

File tree

2 files changed

+28
-0
lines changed

2 files changed

+28
-0
lines changed

doc/FAQ.rst

Lines changed: 27 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,27 @@
1+
==========================
2+
Frequently Asked Questions
3+
==========================
4+
5+
Can I extract all images from MS OLE2 documents with olefile?
6+
-------------------------------------------------------------
7+
8+
Not directly: images are not always stored the same way, and it also depends on the format.
9+
10+
For example in Powerpoint presentations, you may find a stream named "Pictures"
11+
when running "olefile yourfile.ppt". You may extract the stream by using the
12+
openstream() method on the OleFileIO object, but you will usually get a binary
13+
stream containing several picture files. You may also extract it manually using
14+
tools such as SSView (http://www.mitec.cz/ssv.html).
15+
16+
Then the only way I've found so far is to use file carving tools which are
17+
able to determine the beginning and the end of each picture in a binary file.
18+
These tools are not always easy to use but if you're interested have a look
19+
at http://pypi.python.org/pypi/hachoir-subfile
20+
and http://www.forensicswiki.org/wiki/Tools:Data_Recovery#Carving.
21+
22+
If you really need to automate the process then you have to study Microsoft
23+
specifications (at http://www.microsoft.com/interop/docs/officebinaryformats.mspx)
24+
and find the right way to parse MS Office documents...
25+
26+
A lot of people (including me) would be very interested if you find a solution! ;-)
27+

doc/index.rst

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -35,6 +35,7 @@ Microscopy file formats, McAfee antivirus quarantine files, etc.
3535
Howto
3636
OLE_Overview
3737
olefile
38+
FAQ
3839

3940

4041

0 commit comments

Comments
 (0)