Mbox2xml
Mbox2xml 2 Released!!
After well over 12,000 downloads of Mbox2xml, version 2 is finally released!
Mbox2xml 2 has better support for various character sets, generates more user friendly webpages and much more!
Scroll down to the download section now!
What is Mbox2xml?
Mbox2xml is an email archiver for Mozilla Thunderbird, which extracts messages from the mbox format used by Mozilla Thunderbird and stores them in an XML format, which can be viewed as web pages in a browser.
Backing up emails can be a cumbersome task, and the results are often not very user friendly nor easily accessible. These are the two main reasons why Mbox2xml was created.
We thought an email archiving tool should have the following properties:
- Easy to use.
- The archives should be easily accessible on any computer from any data storage device.
- The tool should allow for automation so it can easily be integrated with existing back-up methods.
Therefore it was decided that the program should store the exported emails in an XML format which, together with a provided stylesheet, can be viewed in any modern webbrowser, no matter where the backup is stored. Also, XML is easily imported into existing software solutions. And finally, Mbox2xml has both a graphical user interface for ease of use, and a command line version for easier automation.
TOPDownload
Mbox2xml does not contain any spyware, viruses, trojans nor other unpleasantries. Mbox2xml may be used free of charge in a personal, non-commercial setting. Please contact us at tools@elit.nl before using it in any other situation.
Mbox2xml version 2.03.0
| download | last update | size |
|---|---|---|
| Mbox2xml windows installer This installer can be run in 'silent' mode from the command line by providing the /S parameter. Add /D=<path> to install to a non-default location. | 2013-05-18 10:37:36 | 7.66 MB |
| Mbox2xml 7-zip archive This archive, in 7-Zip format, can be used for manual installation. | 2013-05-18 08:34:14 | 5.30 MB |
If you like Mbox2xml, then please consider supporting us with a small donation.
The Mbox Format
The email client for which Mbox2xml is initially designed, is Mozilla Thunderbird. This popular program stores the emails it receives in a format called Mbox, which is widely used by other mail programs. Although untested, Mbox2xml should be able to export emails from any Mbox file, whether it's created by Thunderbird or not.
Where to find the Mbox files?
In Mozilla Thunderbird, go to the 'tools' menu, and select 'Account Settings'. You'll see a configuration screen with settings for each email account. Select the 'Server Settings' for each account, to see where the emails are stored locally. Mbox files have no extension, and are named like the folders in Thunderbird (e.g. 'Inbox'). Mbox2xml can also search for Mbox files. See the manual section for more details.
Finally, there are two things you need to keep in mind:
- Thunderbird stores sent emails in a seperate Mbox file by default. Make sure you export both the Inbox and Sent folder if you want Mbox2xml to reconstruct your email conversations.
- Before using Mbox2xml, please make sure you let Thunderbird compact your mail folders. This will make Thunderbird remove 'junk' and prevent it from showing up in your export. If old junk mail still keeps lingering, please have a look here.
Features
Mbox2xml has the following features:
- Extracts emails, including attachments, from one or more mbox files and stores them in XML files and attachment directories respectively.
- Can generate an XSL stylesheet which transforms the XML into an easy to use webpage, which allows you to sort and browse the messages by date and thread.
- Contains both a commandline version and one with a graphical user interface.
- Emails can be exported for a given date range, or new messages can be appended to an already existing export.
Many other options can be specified both at the commandline or through the graphical user interface.
TOPManual
The graphical user interface offers a convenient way for you to combine multiple mbox files into one xml export. The selections you make and options you set can be saved in a file, so they can be easily reloaded next time you use Mbox2xml.
The command line version can be found in your install directory under the name 'mbox2xml.exe'. This version can also use the configuration files created with the graphical version. Type 'mbox2xml -h' at the command prompt for further details.
The following table shows an overview of most the features and settings of Mbox2xml.
TOP| item | description |
|---|---|
| Title | The title to be shown at the top of your export pages. |
| Mboxes | This element show a list of the mbox files you've selected for export. |
| Manually select an mbox file to add to the list. | |
| Have Mbox2xml search a certain location on your hard drive for mbox files. | |
| Destination | The location on your hard drive to which Mbox2xml will write the exported messages. |
| Browse for a location to which to export your messages. | |
| Start Date & Time | Click on the select box and choose a date from the pop-up calendar. All messages sent or received on this date or later will be exported. To remove a selected date, uncheck the box next to it. Use the time box below if you also want to add a specific time. |
| Append messages | If you select this option and Mbox2xml finds a previous export at the given destination, then only messages newer than the newest in the old export will be exported and appended to the old export. |
| Overwrite | This option tells Mbox2xml to overwrite any older exports found at the export destination. |
| Ignore attachments | Mbox2xml will not export attachments when this options is selected. |
| Include stylesheet | If this option is selected, Mbox2xml will include extra files to the export which make it viewable in any modern webbrowser. |
| Include raw header | Use this option to force Mbox2xml to include ALL header information from your emails. This is generally not needed, but may suit advanced users. |
| Launch on complete | When this option is selected, the export will be opened in your default webbrowser after the export is complete. |
| Messages per page | If your export has more messages than indicated by this number, then Mbox2xml will devide them over multiple pages. This can be helpful when exporting large amounts of messages, since webbrowsers can have trouble rendering them in a single page. |
| Start Export | This button starts the export process. |
Changelog
- 2011-07-26: Fixed permission issues with Windows Vista/7 and some small bug fixes.
- 2010-02-23: Removed a bug that caused an error when ignoring attachments.
- 2010-01-01: A new command line option has been added (-c
), which allows Mbox2xml to scan for mbox files each time it's run. - 2009-12-07: Various bug fixes related to the extraction of attachments.
- 2009-11-21: Added horizontal scrolling to the Mbox list and fixed an error which could result in faulty XML.
- 2009-10-23: Quick filters added to the web pages, for easy searching.
- 2009-10-13: First release of Mbox2xml 2.0!
Feedback
We are very grateful for any feedback or suggestions you might give us. Thank you in advance!
Posts
unkown :: 2010-12-09 17:54:20
great tool, very useful ;)
however i found a bug when converting newer emails. Apparently the date is causing problems, here's the error output:
2010-12-09 18:15:37 - WARNING messagewriter(354): Could not parse date: year=10 is before 1900; the datetime strftime() methods require year >= 1900
2010-12-09 18:28:20 - ERROR controller(185):
Error while running export job!
Traceback (most recent call last):
File "controller.pyo", line 163, in run_job
File "indexer.pyo", line 86, in write_index
ValueError: year=10 is before 1900; the datetime strftime() methods require year >= 1900
2010-12-09 18:28:20 - INFO mbox2xmlgui(441): Export complete
Keith :: 2010-08-23 14:20:55
I like to automatically back up email as part of my daily shutdown procedure but have abandoned all hope of getting the mbox2xml command line to work without restart-forcing crashes. Here is the AutoIT script I now use in compiled form to get the same result.
Opt("WinDetectHiddenText", 1)
$MBOXTEMPLATE = "E:\test\Update-Email-Archive.m2x"
$PID = Run ( @ProgramFilesDir & "\mbox2xml\mbox2xmlgui.exe" & " " & '"' & $MBOXTEMPLATE & '"' )
WinWaitActive ( "Mbox2xml", "", 5)
ControlFocus("Mbox2xml", "", "[ID:143]")
ControlClick("Mbox2xml", "", "[ID:143]")
WinWaitActive ( "Mbox2xml", "Export complete")
If WinExists("Mbox2xml", "Export complete") Then
ProcessClose($PID)
EndIf
Elit :: 2010-06-15 07:05:49
Hi Nancy,
Email messages can be sent in 2 different ways: as plain text, without any layout information, or as HTML pages with a specific layout.
When sending an HTML formatted message, most email clients will include a plain text copy. In that case you will see the plain text in your export and an attachment along with it, which has the same text but with formatting.
When you don't see a message text but only an 'html' attachment, that's probably because the email client used to send the message did not include a plain text version.
nancy :: 2010-06-14 15:33:30
why on the xml web browser pages do some message text content show up in in a little frame so you can read it while others are show at "part-001.html" links?
nancy
Elit :: 2010-06-14 07:07:19
@Chuck
The AppData folder probably is 'hidden', so please make sure Windows show you hidden files. Also, you might be able to just type in C:\Users\<Username>\AppData into the Windows Explorer address bar.
Of course you could always change the location where Thunderbird stores your emails.
@Blair
Please remember that Chrome is officially still in beta phase :)
Blair :: 2010-05-30 09:22:45
Somehow after the update to Chrome 5 the "index.xml" is not displaying any content anymore. :(
TOPchuck :: 2010-04-30 16:17:55
I am using this great program on a Vista machine with no difficulties.
I bought a new computer with WIN 7 and am locked out of the Thunderbird folder with the email in it. Can you tell me how to set up Mbox2 so I can begin backing up my mail?
Thunderbird stores my mail at:
C:\Users\chuck\AppData\Roaming\Thunderbird\Profiles\20che1xs.default\Mail\pop.mail.yahoo.com
I cannot get to the the directory AppData It does not appear in Explorer (the folder "Chuck" is preceded with an icon of a padlock and the the first folder under Chuck is "Contacts"
Any help would be appreciated.
Elit :: 2010-04-01 05:12:37
I'm afraid there isn't a way to import your messages back into Thunderbird using Mbox2xml.
TOPworstcase :: 2010-04-01 01:58:51
Hello! Is it possible to import the *.xml-files back into Thunderbird? And if so - how to do that?
Thankyou!
Elit :: 2010-03-20 14:43:31
No there isn't. You are probably looking for 'index.xml'.
TOPunkown :: 2010-03-20 06:03:43
is there an index.html missing?
TOPElit :: 2010-03-14 22:39:10
Hi Marek,
thanks for the ideas, specially the one about standardized dates.
Marek :: 2010-03-14 20:22:01
I just stumbled upon Mbox2XML, and find it a wonderful tool! Congratulations! I have recovered a number of mailboxes from my previous mailer (Pegasus), and I can read the archives again.
My interest and potential use for this program would be to allow archiving old email in a format, that can be read another 30 years from now... (SMTP protocol was created in 1980). Paper letters 100 or 500 year old are still readable, old email hardly is. XML is a protocol that will definitely survive next 30 years, so far so good.
Some small comments / questions:
Would it be possible to use standard XML Schema date/time format (YYYY-MM-DDThh:mm:ss)?. Anything that conforms better to standards has slightly better chance of surviving.
Are you planning to save the address-book data as well? My 30 year archive would benefit from having stored the address books for people emailing 30 years ago?
Once again thanks for a great product
Marek
Thomas :: 2010-03-14 09:07:11
I separated the malicious mails and examined the mails with a test editor. They have all date data the following way: "Date: 22 May 09 16:36:04 CEST" - the century digits are droped. These mails are mainly machine generated mails like order confiramtions. I corrected the date data within the mbox files and now the conversion works without any error.
TOPElit :: 2010-03-10 13:56:39
@Thomas:
Thanks for the report. It looks like some date data is badly/oddly formatted. You could help by opening the offending mbox file in a text editor (like Windows' Notepad), find the offending message and send at least the header information to us. Please make sure to remove any private information prior to sending.
In any case, we will adjust Mbox2xml soon to handle the error more gracefully, so the export will continue as it should, although some date(s) might be lost for the time being.
@John Ferris:
Thank you very much for the compliment!
John Ferris :: 2010-03-09 06:09:03
The improved style sheet is superb. Prior to using Mbox, I used to backup all my profiles onto a CD.
To find old emails meant having to have a seprate version of Thunderbird installed to recover the profiles.
Mbox style sheet lists all my emails back as far as january 2007 in lists of 50. (14570 emails altogether).
This has allowed me to clean out my mboxes and I now simply do an incremental backup once a month.
Totally satisfied with the simplicity of the program.
John Ferris
Muswellbrook NSW
Australia
Thomas :: 2010-03-07 17:38:18
Hello,
first thanks for the great tool! First I try'd to convert a relative small mbox file (mbox file with about 40 mails) to xml file and the reasult looked very pretty. So I thought of converting my mail archives containing about 270 mails in xml format but the tool failed with the error message:
Error while running export job!
Traceback (most recent call last):
File "controller.pyo", line 163, in run_job
File "indexer.pyo", line 86, in write_index
ValueError: year=9 is before 1900; the datetime strftime() methods require year >= 1900
18:14 - Export complete
There are a lot "message_xxx.xml" files generated but no "index.xml" when I export my mails of a whole year. I can reproduce the error with several other mail folders, so if you would like to have some more information please tell me.
I hope you can find the problem with the error message above.
Thanks a lot!
Elit :: 2010-02-26 13:54:42
Don't worry Dave, your and any feedback is greatly appreciated!
We're planning on taking Mbox2xml beyond the scope of Thunderbird and the mbox format, so it will support Outlook's pst format. When this will happen, we can not yet say.
Dave Ashton :: 2010-02-26 08:59:47
Oops. The message board has stripped the whitspaces I was trying to demonstrate
TOPDave Ashton :: 2010-02-26 08:58:24
Sorry I have a problems with the format of my headers. The general rule (from RFC822) is that wherever there may be linear-white-space (NOT simply LWSP-chars), a CRLF immediately followed by AT LEAST one LWSP-char may instead be inserted.Thus, the single line
To: "Joe & J. Harvey" <ddd @Org>, JJV @ BBN
can be represented as:
To: "Joe & J. Harvey" <ddd @ Org>,
JJV@BBN
and
To: "Joe & J. Harvey"
<ddd@ Org>, JJV
@BBN
and
To: "Joe &
J. Harvey" <ddd @ Org>, JJV @ BBN
I have checked the raw text mailbox files that I have produced by using readpst. There is not always a space following the carriage return in the folded lines. My mistake sorry!!
Once a field has been unfolded, it may be viewed as being composed of a field-name followed by a colon (":"), followed by a field-body, and terminated by a carriage-return/line-feed.
The field-name must be composed of printable ASCII characters (i.e., characters that have values between 33. and 126., decimal, except colon). Space is ASCII 32. So the Microsoft ones are non-conformant. Again my mistake.
Elit :: 2010-02-23 13:38:49
Hi Dave,
I'm not exactly sure what's going on there, but I suspect your mbox file is violating specifications.
We will look into it soon, to see if we can make Mbox2xml more robust.
Dave :: 2010-02-21 14:13:52
This program has some problems. Version downloaded and installed 21/2/2010 16:00 GMT+2:00
1)If the mail header includes
Microsoft Mail Internet Headers Version 2.0: Microsoft Mail Internet Headers Version
2.0
It stops reading the headers and puts the remainder of the mail into the body. Missing out subject to's, from's cc's etc
2) If a header field extends beyond one line e.g. if the mail has a long subject or a long list of to's the remainder of the header is not parsed correctly but put into the mail body.
Elit :: 2010-02-20 12:52:51
Hmmm, interesting suggestion.
It would be quite a bit of work, but who knows.
Numron :: 2010-02-18 12:59:44
Hi, thanks for the tool!
There is only 1 enhancement missing. If an export includes subfolders, the tree-structure will be lost after exporting. Instead of having a flat list of emails, I would rather geht a list with expandable subtrees.
Do you think this is something we will see in the futur?
Thanks
Numron
© 2000 - 2013 - last update: