AutoSketch

July 10, 2026 by Thor 1 Comment

There was a lot of CAD programs in the 1980’s through the 1990’s. Some titles might be TurboCAD, DesignCAD, Generic CADD, KeyCAD, SwiftCAD, CADKEY, VersaCAD, MiniCAD, EasyCAD, and of course AutoCAD. Also, many file formats. Some had chosen the common DXF exchange format, but most chose their own way to do things.

AutoCAD is software made by AutoDesk, but is one of many software tools made by them. AutoCAD has its own native file format, DWG, as is well documented, but there was a little brother to AutoCAD marketed for a few years and it also had its own format(s).

AutoSketch was a less powerful version of AutoCAD and only supported 2D models, but still found a good following of users. Unlike its big brother, the AutoSketch native format(s) don’t have much documentation and are not supported in any AutoDesk product today. The PRONOM entry for the SKF (fmt/306) is missing a signature, it is an outline entry only. In addition, SKF, AutoSketch had an earlier format that needs to be documented and a signature created. Let’s take a look.

The first version of AutoSketch came out in 1986 for DOS and files stored in RAM which could be written out as a SKD file. Looking at a saved file, we see a very easy to identify file.

% hexdump -C TEMPLATE.SKD | head
00000000  41 75 74 6f 53 6b 65 74  63 68 20 64 72 61 77 69  |AutoSketch drawi|
00000010  6e 67 20 64 61 74 61 62  61 73 65 0d 0a 1a 00 00  |ng database.....|
00000020  49 03 00 44 00 00 00 01  00 00 00 00 00 00 00 00  |I..D............|
00000030  00 00 00 00 00 40 41 00  00 10 41 00 00 00 00 00  |.....@A...A.....|
00000040  00 00 00 00 00 40 41 00  00 10 41 00 00 00 3f 01  |.....@A...A...?.|
00000050  00 09 00 00 00 00 00 00  00 00 00 00 00 00 00 33  |...............3|
00000060  33 33 33 33 33 d3 3f 00  00 00 00 00 00 f0 3f 00  |33333??......??.|
00000070  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
00000080  00 80 3f 00 00 80 3f 0a  00 08 00 00 00 00 00 ff  |..?...?........?|
00000090  ff 00 00 00 00 00 00 00  00 00 00 00 00 00 00 25  |?..............%|

Most formats don’t make it this easy. Having the ASCII string “AutoSketch drawing database” makes identification very easy. Lets see if this pattern stays true with the next version, version 2.0 for DOS.

% hexdump -C ENGINE.SKD | head
00000000  41 75 74 6f 53 6b 65 74  63 68 20 64 72 61 77 69  |AutoSketch drawi|
00000010  6e 67 20 64 61 74 61 62  61 73 65 0d 0a 1a 00 fd  |ng database....?|
00000020  49 06 00 12 04 00 00 01  00 00 00 64 20 88 40 f7  |I..........d .@?|
00000030  eb 38 c2 88 fe 1b 44 25  83 dd 43 00 00 00 00 00  |?8?.?.D%.?C.....|
00000040  00 00 00 00 80 1d 44 00  80 d9 43 00 00 00 41 01  |......D..?C...A.|
00000050  00 07 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
00000060  00 00 00 00 00 14 40 00  00 00 00 00 00 f0 3f 00  |......@......??.|
00000070  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
00000080  00 40 41 00 00 40 41 0a  00 08 00 00 00 00 00 1f  |.@A..@A.........|
00000090  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 1d  |................|

AutoDesk also released version 3.0 of AutoSketch in 1990.

Version 3.0 of AutoSketch produced files which appear very similar, but needs a closer look.

% hexdump -C ASSY.SKD | head
00000000  41 75 74 6f 53 6b 65 74  63 68 20 64 72 61 77 69  |AutoSketch drawi|
00000010  6e 67 20 64 61 74 61 62  61 73 65 0d 0a 1a 00 8b  |ng database.....|
00000020  49 09 00 db 00 00 00 01  00 00 00 00 00 00 00 00  |I..?............|
00000030  00 00 00 b7 cc a4 41 79  99 79 41 00 00 00 00 00  |...?̤Ay.yA.....|
00000040  00 00 00 00 00 04 42 00  00 90 41 00 00 00 3f 01  |......B...A...?.|
00000050  00 07 00 00 00 00 00 00  00 00 00 00 00 00 00 9a  |................|
00000060  99 99 99 99 99 c9 3f 00  00 00 00 00 00 f0 3f 00  |.....??......??.|
00000070  00 00 00 00 00 00 00 00  00 00 00 01 00 00 00 00  |................|
00000080  00 80 3f 00 00 80 3f 0a  00 08 00 00 00 00 00 ff  |..?...?........?|
00000090  00 00 00 00 00 00 00 00  00 33 33 33 33 33 f3 1e  |.........33333?.|
*
000001b0  00 00 00 00 00 f0 3f 01  00 01 00 00 00 00 00 00  |.....??.........|
000001c0  00 00 00 00 00 00 00 74  78 74 00 f5 eb 0e 05 08  |.......txt.??...|
000001d0  04 15 01 08 00 eb 02 62  c0 76 32 00 fe fe fe fe  |.....?.b?v2.????|
000001e0  fe fe fe fe fe fe fe fe  fe 55 55 55 55 55 55 55  |?????????UUUUUUU|
000001f0  55 57 77 55 55 55 55 55  55 55 55 55 55 55 55 55  |UWwUUUUUUUUUUUUU|
00000200  55 55 55 55 55 55 35 55  53 33 55 00 00 00 00 00  |UUUUUU5US3U.....|
00000210  00 50 00 00 00 07 77 77  77 77 77 77 77 77 77 00  |.P....wwwwwwwww.|

Turns out AutoDesk made a change to the SKD format and you can see the version number at 473rd byte (0x1D9). This version of SKD is required for later versions of AutoSketch to open and use the format. A special version of AutoSketch, 2.1fc, was released with limited features to allow users to open earlier versions of the SKD format and convert them to the second version of the format. The readme file for the 2.1fc version has this statement:

This limited release of AutoSketch has been provided for file conversion purposes only. All functionality other than opening and saving files has been removed from this release.

Please follow the steps below to perform a drawing file conversion for files created in AutoSketch Release 2.0 for Windows, or earlier. Once drawing files have been converted, you will be able to open them in AutoSketch Release 5.0 and later.

AutoDesk released the first version of AutoSketch for Windows in 1992. This first version for Windows used the new SKD format as well. Even though it was version 1.0 for Windows, the new version of the format is known as the v2.x format.

% hexdump -C AS1-S01.SKD
00000000  41 75 74 6f 53 6b 65 74  63 68 20 64 72 61 77 69  |AutoSketch drawi|
00000010  6e 67 20 64 61 74 61 62  61 73 65 0d 0a 1a 00 00  |ng database.....|
00000020  49 0b 00 00 00 00 00 01  00 00 00 00 00 00 00 00  |I...............|
00000030  00 00 00 00 00 40 41 00  00 10 41 00 00 00 00 00  |.....@A...A.....|
00000040  00 00 00 00 00 40 41 00  00 10 41 00 00 80 3f 01  |.....@A...A...?.|
00000050  00 07 00 00 00 00 00 00  00 00 00 00 00 00 00 33  |...............3|
00000060  33 33 33 33 33 d3 3f 00  00 00 00 00 00 f0 3f 00  |33333??......??.|
00000070  00 00 00 00 00 00 00 00  00 00 00 01 00 00 00 00  |................|
00000080  00 80 3f 00 00 80 3f 0a  00 08 00 00 00 00 00 ff  |..?...?........?|
00000090  ff 00 00 00 00 00 00 00  00 9a 99 99 99 99 99 1f  |?...............|
000000a0  40 cd cc cc cc cc cc 23  40 00 00 01 00 00 00 00  |@??????#@.......|
000000b0  00 00 00 00 00 f0 3f 00  00 00 00 00 00 f0 3f 00  |.....??......??.|
000000c0  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
*
000001d0  00 00 00 00 00 00 00 e8  9a 76 32 00 00 00 00 00  |.......?.v2.....|
000001e0  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|

This is the format used through version 2.1 for Windows which was released in 1996. It was after this version AutoDesk made a major change to the software. In version 5, skipping 3 and 4, an entirely new drawing engine was used, changing the software and the file format. This new format is actually the format used by the software the new drawing engine came from, DrafixCAD. AutoDesk purchased the software company Softdesk Drafix and used the DrafixCAD software, originally developed by Foresight Resources Corp., for this new version of AutoSketch.

Let’s compare a couple of DrafixCAD files to an AutoSketch 5 and 6 file.

% hexdump -C DEMOPLAN.CAD | head
00000000  0a 75 00 00 00 0b 02 38  30 13 05 46 4c 4f 41 54  |.u.....80..FLOAT|
00000010  14 05 53 48 4f 52 54 0c  04 30 2e 38 30 0d 08 4f  |..SHORT..0.80..O|
00000020  63 74 20 31 39 38 39 0e  12 44 72 61 66 69 78 20  |ct 1989..Drafix |
00000030  57 69 6e 64 6f 77 73 20  43 41 44 0f 06 49 42 4d  |Windows CAD..IBM|
00000040  20 50 43 10 07 44 4f 53  20 33 2e 30 11 06 4d 53  | PC..DOS 3.0..MS|
00000050  20 32 2e 31 12 1e 31 39  38 39 20 46 6f 72 65 73  | 2.1..1989 Fores|
00000060  69 67 68 74 20 52 65 73  6f 75 72 63 65 73 20 43  |ight Resources C|
00000070  6f 72 70 2e 00 28 43 03  00 00 29 b6 01 00 00 01  |orp..(C...)?....|
00000080  07 00 02 01 00 04 01 00  05 01 00 06 01 00 07 00  |................|
00000090  00 80 3f 08 00 00 09 0d  00 0a 01 00 0c 04 00 11  |..?.............|

% hexdump -C DEMOPLAN.CAD | head
00000000  0a 7d 00 00 00 0b 03 32  30 30 13 06 44 4f 55 42  |.}.....200..DOUB|
00000010  4c 45 14 05 53 48 4f 52  54 0c 03 32 2e 30 0d 0f  |LE..SHORT..2.0..|
00000020  4a 61 6e 75 61 72 79 20  37 2c 20 31 39 39 32 0e  |January 7, 1992.|
00000030  12 44 72 61 66 69 78 20  57 69 6e 64 6f 77 73 20  |.Drafix Windows |
00000040  43 41 44 0f 06 49 42 4d  20 50 43 10 07 44 4f 53  |CAD..IBM PC..DOS|
00000050  20 33 2e 30 11 06 4d 53  20 32 2e 31 12 1e 31 39  | 3.0..MS 2.1..19|
00000060  38 39 20 46 6f 72 65 73  69 67 68 74 20 52 65 73  |89 Foresight Res|
00000070  6f 75 72 63 65 73 20 43  6f 72 70 2e 00 28 28 04  |ources Corp..((.|
00000080  00 00 29 80 02 00 00 04  01 00 05 01 00 07 00 00  |..).............|
00000090  00 00 00 00 00 40 08 01  00 0a 01 00 0c 04 00 11  |.....@..........|

% hexdump -C ASv5-s01.SKF | head
00000000  0a 00 00 00 00 0b 03 34  30 31 01 00 00 00 38 00  |.......401....8.|
00000010  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|

% hexdump -C ASv6-s01.SKF | head
00000000  0a 00 00 00 00 0b 03 34  30 32 03 36 2e 30 01 20  |.......402.6.0. |
00000010  0f 4f 63 74 6f 62 65 72  20 32 2c 20 31 39 39 38  |.October 2, 1998|
00000020  01 00 00 00 38 00 00 00  00 00 00 00 00 00 00 00  |....8...........|
00000030  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|

You can see the similar pattern in the first 8 bytes. Just the 2nd and 7th byte being different. The 8th byte is consistent with all SKF files, and the 9th and 10th bytes turn out to be in sequence depending on the version.

Looking at the next few versions of AutoSketch we see:

% hexdump -C AS7-s01.SKF | head
00000000  0a 00 00 00 00 0b 03 34  31 32 03 37 2e 30 08 42  |.......412.7.0.B|
00000010  75 69 6c 64 20 35 34 10  4f 63 74 6f 62 65 72 20  |uild 54.October |
00000020  31 35 2c 20 31 39 39 39  01 00 00 00 38 00 00 00  |15, 1999....8...|
00000030  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|

% hexdump -C AutoSketch8-s01.SKF | head
00000000  0a 00 00 00 00 0b 03 34  31 32 03 39 2e 30 08 42  |.......412.9.0.B|
00000010  75 69 6c 64 20 38 38 0a  30 37 2f 30 38 2f 32 30  |uild 88.07/08/20|
00000020  30 34 01 00 00 00 38 00  00 00 00 00 00 00 00 00  |04....8.........|
00000030  00 40 00 00 00 00 00 00  1f 40 00 00 00 00 00 00  |.@.......@......|
00000040  00 00 00 00 00 00 00 00  00 40 00 00 00 00 00 00  |.........@......|
00000050  20 40 00 00 00 00 00 00  00 00 00 00 00 00 00 00  | @..............|
00000060  00 00 02 00 00 00 76 5e  00 00 42 4d 76 5e 00 00  |......v^..BMv^..|
00000070  00 00 00 00 76 00 00 00  28 00 00 00 00 01 00 00  |....v...(.......|
00000080  bc 00 00 00 01 00 04 00  00 00 00 00 00 5e 00 00  |?............^..|
00000090  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|

% hexdump -C AutoSketch9-s01.SKF | head
00000000  0a 00 00 00 00 0b 03 34  31 33 03 39 2e 30 08 42  |.......413.9.0.B|
00000010  75 69 6c 64 20 38 38 0a  30 37 2f 30 38 2f 32 30  |uild 88.07/08/20|
00000020  30 34 01 00 00 00 38 00  00 00 00 00 00 00 00 00  |04....8.........|
00000030  00 40 00 00 00 00 00 00  1f 40 00 00 00 00 00 00  |.@.......@......|
00000040  00 00 00 00 00 00 00 00  00 40 00 00 00 00 00 00  |.........@......|
00000050  20 40 00 00 00 00 00 00  00 00 00 00 00 00 00 00  | @..............|
00000060  00 00 02 00 00 00 76 5e  00 00 42 4d 76 5e 00 00  |......v^..BMv^..|
00000070  00 00 00 00 76 00 00 00  28 00 00 00 00 01 00 00  |....v...(.......|
00000080  bc 00 00 00 01 00 04 00  00 00 00 00 00 5e 00 00  |?............^..|
00000090  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|

% hexdump -C AS10-s01.SKF | head
00000000  0a 00 00 00 00 0b 03 34  31 33 04 31 30 2e 30 08  |.......413.10.0.|
00000010  42 75 69 6c 64 20 31 31  0a 31 32 2f 30 32 2f 32  |Build 11.12/02/2|
00000020  30 30 38 01 00 00 00 38  00 00 00 00 00 00 00 00  |008....8........|
00000030  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
00000040  00 00 00 25 91 48 24 12  89 30 40 5d 2c 16 8b c5  |...%.H$..0@],..?|
00000050  62 27 40 00 00 00 00 00  00 00 00 00 00 00 00 86  |b'@.............|
00000060  01 36 00 02 00 00 00 76  5e 00 00 42 4d 76 5e 00  |.6.....v^..BMv^.|
00000070  00 00 00 00 00 76 00 00  00 28 00 00 00 00 01 00  |.....v...(......|
00000080  00 bc 00 00 00 01 00 04  00 00 00 00 00 00 5e 00  |.?............^.|
00000090  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|

Version 10 was the final version released of AutoSketch in 2008. It appears the SKF files from version 7 & 8 share the same format and 9 & 10 share one as well. This seems to be confirmed in the save as dialog in version 9 which lumps version 7 & 8 together.

So, if we wanted to make one signature for all SKF files, it might look something like:

0A000000000B0334(30|31)(31|32|33)

Or we can choose to split out versions if needed. For now, adding this signature to the existing PRONOM entry for SKF would be helpful. As for the earlier SKD format, we clearly need two signatures, one for the first version and one for the second.

Here is a small timeline of the different versions of AutoSketch.

AutoSketch Release 1.0 DOS 1986-10
AutoSketch Release 1.01 DOS 1987-02
AutoSketch Release 1.02 DOS 1987-09
AutoSketch Release 1.03 DOS 1988-02
AutoSketch Release 2.0 DOS 1989
AutoSketch Release 3.0 DOS 1990
AutoSketch Release 3.1 DOS 1991
AutoSketch Release 1.0 Windows 1992
AutoSketch Release 2.0 Windows 1994
AutoSketch Release 2.1 Windows 1996
AutoSketch Release 5 Windows 1997-11
AutoSketch Release 6 Windows 1998-10
AutoSketch Release 7 Windows 1999
AutoSketch Release 8 Windows 2001
AutoSketch Release 9 Windows 2004-09-20
AutoSketch Release 10 Windows 2008-11

I need to do a little more research on the DrqafixCAD format, but I will update the signature when I do. As always take a look at my samples and suggested signatures on my GitHub page.

Sleeves of Tyvek

May 25, 2026 by Thor 1 Comment

In honor of the 25th of May, aka #525FloppyDay, I have a few thoughts on 5.25″ Floppy Disks, more specifically, the sleeves that protect them.

This year marks the 50th anniversary of the 5.25″ Floppy Disk. Prior to 1976, Floppy Disks were a massive 8 inches square which also used sleeves. It wasn’t until the need to make the disk smaller or “mini-diskette” in 1976 that the media became more popular and used in most desktop computers systems in the 1980’s. I won’t get into all the different ways data was encoded on these diskette’s as that is a massive topic, so today we will look at an often overlooked feature of the media. The protective sleeve.

For the sake of consistency, I will call this protection a sleeve, but may also be referred to as an envelope, jacket, or a pouch. Pouch being used by the folks down-under. The sleeve was mainly used for storage of the disk, but was also a great marketing tool and on the backside a reminder of the proper care of the disk. Floppy disk manufacturers and software companies would use the sleeve to print their logo and other important information about the disk, some opting for blank or generic graphics. Today we will look at some of the creative and not so creative uses.

Sleeves for floppy disks were made of a few different types of material. Plain paper sleeves were very common, many of which have yellowed over time. Some were very thin, others quite thick to offer more protection. One type of material often used for a sleeve is called Olefin. Kind of a cross between paper and plastic, this spun bonded material was used by many manufactures because of this superior durability an its ability repel water. This material made by Dupont was named Tyvek and has many uses, one use being found protecting the walls of your home.

As far as I know there is no “archive” documenting all floppy disk sleeves. There is a website called the Original Disk Sleeve Archive started years ago and has many sleeves referenced, but hasn’t been updated in awhile. Jason Scott at the Internet Archive has also uploaded scans he has made, here and here.

Today I would like to share my collection of Sleeves with you. I scanned them myself from my collection at a massive 1200 dpi so you can explore all the details and find creative uses for them.

You will find most of the sleeves have a logo or name on the front, but the back may have some interesting icons. These icons helped remind us of the fragile nature of floppy disks and the care needed to make them last. The icons can be understood without any explanation, using familiar images. Some having the typical circle with a line through it, indicating to NOT do something, others would use simple words in a few languages to get the meaning across. Then you have some who decided to go with humor or clever phrases to get the message across. In all, the messages were clear and understood, but as the floppy disk lost its usefulness over time, the icons and messages disappeared to time as well.

You might noticed many of the different sleeves in my collection have a common set of icons, which were probably bought instead of designing their own by many companies. Others spent some time and effect into designing their own.

Generally they all follow they same message, Do not bend, only use soft tip pens or markers, never use paper clips, don’t touch the magnetic media with your hands, and most of all, keep magnets away from the disk. Others would indicate to be sure to insert into the drive carefully and not to force them. Many list the proper temperature and humidity to store the disks, which is a very wide range.

The sources of magnetism are very different from what we might find today in a typical office or home. No longer do we see large heavy telephones which huge magnets inside, or huge CRT monitors, nor do many of us keep large multimedia speakers at our desks, but these were real dangers when floppy disks were in use. For those who use floppy disks today, we have new unknown dangers to contend with, the modern laptop and cell phone which can be deadly to magnetic media. I explore this and other myths in a paper submitted to iPres this fall, stay tuned!

One of my favorite practical back sides is this one from Elephant Memory Systems. Don’t touch the shiny Parts! Also a reminder of making a copy of your data if it’s important, a good preservation principle.

There are some slightly humorous instructions, like not letting your disk drink or smoke.

Or this clever set of instructions. Letting us know to avoid letting our disks die an agonizing death and to keep them happy.

But of course the the most famous of all disk care backside instructions go to the Beagle Bros. They decided to go with something less practical and more humorous to get their point across.

The Beagle Bros were a software company and distributor who mainly distributed Apple II software in the 1980’s. The art is credited to Fred Crone, an artist for the Beagle Bros, who with is wife Sara created much of the art used by the software company. The warnings were so popular, they got asked frequently if the warnings could be reprinted and permission was happily given. You can find more images and history at the Beagles Bros Repository. Which is your favorite?

If you happen to catch me at a conference or send me a nice message below, I might part with a few stickers I have made using many of the icons and warnings you find on the back of these sleeves. Enjoy.

Discus

May 22, 2026 by Thor Leave a comment

I found myself in the same situation again with a colleague asking me for help identifying an unknown file. The file in question did not have an extension and to make things harder, the file could not be shared with me, only the header. So not having much to go on I started with some assumptions. Not having an extension leans toward the file being from an early Macintosh system. My favorite. I asked some follow up questions and learned the file was from around 2002, but no longer had any extended attributes that might contain a type creator code to help with identification. There was also mentioned of layers and fonts later in the file I was told.

Looking at the header sent to me, there was an obviously first choice to look into.

cd cd 20 07 43 41 4e 56 (ÍÍ CANV)

With the ASCII text “CANV“, the initial thought was this file was related to the popular Canvas software. I did a post on the Canvas formats a couple years ago so I went back through my files and could not find any match. Most uncompressed Canvas files use “CANVAS5” or “CANVAS6”, nothing which was shortened to CANV. I check all the samples I have from many versions. Back to square one.

I looked at other graphics and desktop publishing programs from the time period, I even asked AI to help me narrow it down. AI also recommended Canvas, but also many others I was already aware of and had dismissed. AI did not know what to make of the CANV string in the header, it was not too helpful. I did follow a couple of leads it gave me, but they led to dead ends. One title I looked into was called Desktop Publisher Pro. Files from this software also had a unique header, but nothing resembling what I was looking for. Maybe I will do another post of this software in the future.

So the next step in my process was to scan through all my sample data sets for something that matched or was close. I made a simple signature with the CANV header, but also one with the “CDCD” hex values as it seemed unique as well. I set the scan to run overnight on my sample set and the next morning I was met with disappointment. Not one match. I decided to run the same signatures across a few of my other drives of personal files, just for good measure. The next morning I had a surprise. Within the 10’s of thousands of files scanned, one file popped. The file the scan found also had no extension and the software that created it was no longer on my computer so the file was not associated with any software. But luckily in my case, the Type/Creator code was still attached!

% getfileinfo "MC Wrap" 
file: "Martins Cove Wrap"
type: "DISC"
creator: "ARTD"
attributes: avbstclinmedz
created: 09/06/2005 13:52:35
modified: 09/06/2005 13:52:35

% hexdump -C "MC Wrap" | head
00000000  cd cd 00 07 43 41 4e 56  00 00 00 01 00 00 00 12  |??..CANV........|
00000010  00 03 00 01 00 05 02 20  00 16 00 00 00 00 00 09  |....... ........|
00000020  00 32 50 52 45 56 00 00  00 01 00 00 34 6e 00 02  |.2PREV......4n..|
00000030  00 00 00 00 00 00 00 43  00 64 00 00 00 00 00 43  |.......C.d.....C|
00000040  00 64 00 10 42 10 42 10  42 10 42 10 42 10 42 10  |.d..B.B.B.B.B.B.|
00000050  42 10 42 10 42 10 42 10  42 10 42 10 42 10 42 10  |B.B.B.B.B.B.B.B.|
*
00000100  42 10 42 10 42 10 42 10  42 10 42 10 42 10 00 21  |B.B.B.B.B.B.B..!|
00000110  00 21 00 21 00 21 00 21  00 21 00 21 00 21 00 21  |.!.!.!.!.!.!.!.!|

There it is! The header with CANV. Looking up the Creator code “ARTD” in my TCDB tool, I learned the creator of the file.

% python3 TC-lookup-draft-uni.py "MC Wrap" 
Type Code: DISC
Creator Code: ARTD
Size of Data Fork: 18483930 bytes
Size of Resource Fork: 0 bytes
Rows with Type Code b'DISC' and Creator Code b'ARTD': 
Row index: 11828
File Name: Discus—Disc Lable File
Type: DISC
Creator: ARTD
Comments: Discus
Category: nan
Extension: nan
Dup:  
Data by Ilan Szekely, Jerusalem: nan
------------------------------

The database of Type/Creator codes identified the file as being created by the Discus software. A little search and I remembered the file in question was created by Discus RE which was included with Roxio Toast, a CD/DVD buring software popular on the Macintosh for many years. Discus RE was labeling software bundled with Toast for a few versions, lastly Toast version 7.

I happen to have Toast 6 on my older PowerMac G5 machine which included Discus RE 2.74. I made a few samples to compare.

% hexdump -C DiscusRE2.6-s01 | head
00000000  cd cd 00 07 43 41 4e 56  00 00 00 01 00 00 00 12  |??..CANV........|
00000010  00 03 00 01 00 00 00 0a  00 01 00 00 00 00 00 00  |................|
00000020  00 32 50 52 45 56 00 00  00 01 00 00 4e 2e 00 01  |.2PREV......N...|
00000030  00 00 00 00 00 00 00 64  00 64 00 10 52 94 52 94  |.......d.d..R.R.|
00000040  56 b5 52 94 52 94 56 b5  52 94 52 94 56 b5 52 94  |V?R.R.V?R.R.V?R.|
00000050  52 94 56 b5 52 94 52 94  56 b5 52 94 52 94 56 b5  |R.V?R.R.V?R.R.V?|
00000060  52 94 52 94 56 b5 52 94  52 94 56 b5 52 94 52 94  |R.R.V?R.R.V?R.R.|
00000070  56 b5 52 94 52 94 56 b5  52 94 52 94 56 b5 52 94  |V?R.R.V?R.R.V?R.|
00000080  52 94 56 b5 52 94 52 94  56 b5 52 94 56 b5 5a d6  |R.V?R.R.V?R.V?Z?|
00000090  63 18 42 10 42 10 42 10  42 10 42 10 42 10 42 10  |c.B.B.B.B.B.B.B.|

You may have noticed both my file and this sample have a slightly different header, my samples have “CDCD007” while the header sent to me has “CDCD2007”. A slight difference I am not sure of the meaning. I need to see more samples. Since this was a sample from Discus version 2, I tracked down samples from version 3 and 4.

% hexdump -C DiscusRE3-s01 | head
00000000  cd cd 00 07 43 41 4e 56  00 00 00 01 00 00 00 12  |??..CANV........|
00000010  00 03 00 01 00 00 00 0a  00 01 00 00 00 00 00 00  |................|
00000020  00 32 50 52 45 56 00 00  00 01 00 00 4e 36 00 02  |.2PREV......N6..|
00000030  00 00 00 00 00 00 00 64  00 64 00 00 00 00 00 64  |.......d.d.....d|
00000040  00 64 00 10 52 94 52 94  56 b5 52 94 52 94 56 b5  |.d..R.R.V?R.R.V?|
00000050  52 94 52 94 56 b5 52 94  52 94 56 b5 52 94 52 94  |R.R.V?R.R.V?R.R.|
00000060  56 b5 52 94 52 94 56 b5  52 94 52 94 56 b5 52 94  |V?R.R.V?R.R.V?R.|
00000070  52 94 56 b5 52 94 52 94  56 b5 52 94 52 94 56 b5  |R.V?R.R.V?R.R.V?|
00000080  52 94 52 94 56 b5 52 94  52 94 56 b5 52 94 52 94  |R.R.V?R.R.V?R.R.|
00000090  56 b5 52 94 56 b5 5e f7  67 39 42 10 42 10 42 10  |V?R.V?^?g9B.B.B.|

Version 3 seems to have the same structure. Version 4 is a little different and also has an extension this time. The first two bytes are the same, but has 0008 and is missing the CANV, replaced by PREV.

% hexdump -C Discus4-s01.CD4 | head
00000000  cd cd 00 08 50 52 45 56  00 00 00 01 00 01 8e 36  |??..PREV.......6|
00000010  00 02 00 00 00 04 00 00  00 ba 00 8c 00 04 00 00  |.........?......|
00000020  00 ba 00 8c 00 20 32 32  32 ff 32 32 32 ff 32 32  |.?... 222?222?22|
00000030  32 ff 32 32 32 ff 32 32  32 ff 32 32 32 ff 32 32  |2?222?222?222?22|

I learned the Discus software was created by a software company called Magic Mouse Productions. Discus was written by Edward de Jong, the founder of Magic Mouse Productions, who has also created many other software titles. Discus sold millions of copies, many of which were bundled, but also full versions which unlocked many other label templates and a lot more artwork. The Discus software also included a large selection of artwork which could be used in the label design. Looking at a sample from the full version, I was hoping to see the CDCD2007, but found them to be similar to the other samples.

% hexdump -C Discus3.08-s01 | head
00000000  cd cd 00 07 43 41 4e 56  00 00 00 01 00 00 00 12  |??..CANV........|
00000010  00 03 00 01 00 00 01 f4  00 11 00 00 00 00 00 00  |.......?........|
00000020  00 32 50 52 45 56 00 00  00 01 00 00 4e 36 00 02  |.2PREV......N6..|
00000030  00 00 00 00 00 00 00 64  00 64 00 00 00 00 00 64  |.......d.d.....d|
00000040  00 64 00 10 52 94 52 94  56 b5 52 94 52 94 56 b5  |.d..R.R.V?R.R.V?|
00000050  52 94 52 94 56 b5 52 94  52 94 56 b5 52 94 52 94  |R.R.V?R.R.V?R.R.|
00000060  56 b5 52 94 52 94 56 b5  52 94 52 94 56 b5 52 94  |V?R.R.V?R.R.V?R.|
00000070  52 94 56 b5 52 94 52 94  56 b5 52 94 52 94 56 b5  |R.V?R.R.V?R.R.V?|
00000080  52 94 52 94 56 b5 52 94  52 94 56 b5 52 94 52 94  |R.R.V?R.R.V?R.R.|
00000090  56 b5 52 94 52 94 42 10  31 8c 42 10 42 10 42 10  |V?R.R.B.1.B.B.B.|

I actually reached out to Edward de Jong and asked him about the file format. His response,

The Discus file format is fairly straightforward

The first four bytes are the signature of a Discus file

The Discus software was built for the PowerPC chip for the Macintosh, but also for Windows. The last couple of updates to Discus version 4 added Intel chip support, but then no new versions where created. Edward explained….

I still use Discus myself, but on the Macintosh it stops running after OSX 10.14 because Apple cruelly discontinued their emulator for older operating systems and the Intel instruction set.

After Toast version 7, Roxio used a different labeling software bundled with their burning software. This and many others still need some research and documentation. For now, a signature for Discus will help in weeding out these labels some might find un-needed, but other may find invaluable.

Take a look at some samples and my signature proposal and let me know what you think.

BE DEAD

April 24, 2026 by Thor Leave a comment

If you remember the older post about Cafe Beef, you’ll appreciate the file format we explore in this post which uses using the Hex values “BE DEAD”. I guess they jinxed themselves because the software didn’t survive a refresh in 2009 and died. At one point the software was considered remarkable software being awarded 4.5 Mice by Macworld Magazine in August 2002.

When a colleague reach out to me recently with a file they were not familiar with I jumped in. I love a good challenge. The file had no extension, but was thought to have come from a Windows system. With a little digging I was able to identify the file as a Now Contact file which does have a Windows and Macintosh version, but with no extension, my money was on the file coming from the Mac.

I started my search with the obvious, the first few bytes. Since I only had one file, I wasn’t sure if this would be helpful, but looking at the bytes, I figured it was significant.

% hexdump -C "CONTACT FILE" | head
00000000  be de ad 01 00 00 00 03  00 1d 5f a9 00 7d d1 8e  |.........._..}..|
00000010  00 00 0e d8 98 89 d7 4b  00 8f 31 fd 00 00 3b f0  |.......K..1...;.|
00000020  00 de 76 56 be de ad 00  e6 02 b4 af 63 64 62 68  |..vV........cdbh|
00000030  00 00 00 00 00 00 01 8e  90 8f 56 13 00 0d 09 b4  |..........V.....|
00000040  00 0d 0b e0 00 0d 0c 50  00 0d 0b e8 00 0d 0c 58  |.......P.......X|
00000050  00 00 00 00 ba f3 fa eb  00 00 27 12 00 00 00 00  |..........'.....|
00000060  e6 02 b4 76 00 00 00 00  00 00 00 00 00 00 00 00  |...v............|
00000070  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|

The first three bytes are “BE DE AD“, BE DEAD seems to be done on purpose. A quick search on the web showed no results, no mention of this unique header. I even turned to AI, asking grok if it know the source of this byte sequence. It had no idea. I began digging through the file looking for clues to its software source. The ASCII text I could see indicated some sort of customer database, that along with the file name of “CONTACT FILE” seemed to confirm. I found some dates from 2002 and started looking at popular CRM and PIM software at the time. I then found a reference to a note the user left saying they opened the file on a different Power Tower Pro. I owned one of these clones back in college, so I immediately knew they were using a Macintosh! A quick search of popular contact management software from the early 2000’s revealed a few suspects. I took a look at a product from Now Software, Now Up-to-Date & Contact version 3.9 and I found the header I was looking for! Had the file sent to me retained its extended attributes from the Mac, I would have found this software much quicker.

Now Software has been around since 1990 and was purchased at one point by PowerOn Software. Now Up-to-Date came around in 1992, but Now Contact wasn’t added until 1994. Version 1.0 of the software was standalone and was popular, but simple, when Now Software bundled it with the Now Up-to-Date software in 1995, they skipped version 2 to be in sync.

The Now Contact software has a few functionalities including a Word Processor, but lets stick to the contact manager for now. Let’s take a look at a sample file from version 1.

% hexdump -C "Sample Contact File" | head 
00000000  00 00 4c 47 00 00 00 03  a8 ee f6 c6 a8 ef 9b 95  |..LG............|
00000010  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
*
00000100  12 bc 00 2b 73 74 61 74  00 00 00 01 00 00 00 04  |...+stat........|
00000110  00 00 00 fc 73 74 61 74  00 00 00 02 00 00 01 00  |....stat........|
00000120  00 00 06 44 4b 6e 44 42  00 00 00 00 00 00 07 44  |...DKnDB.......D|
00000130  00 00 04 68 4b 6e 44 42  00 00 00 01 00 00 0b ac  |...hKnDB........|
00000140  00 00 06 12 4b 6e 44 42  00 00 00 02 00 00 13 ac  |....KnDB........|
00000150  00 00 00 70 4b 6e 44 42  00 00 00 03 00 00 15 ac  |...pKnDB........|
00000160  00 00 00 b6 4b 6e 44 42  00 00 00 05 00 00 16 64  |....KnDB.......d|

This file does not have the “BE DE AD” header, but something else. I do see a repeated pattern of the text “KnDB” which also happens to be the Type code used on the Macintosh.

% getfileinfo "Sample Contact File"
type: "KnDB"
creator: "NIC!"
attributes: avbstClinmedz
created: 10/23/1993 13:57:10
modified: 10/24/1993 11:18:58

Another sample

% hexdump -C NC1-s02 | head
00000000  00 00 36 20 00 00 00 03  e6 03 e7 1a e6 03 e7 41  |..6 ...........A|
00000010  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
*
00000100  0c dc 00 26 73 74 61 74  00 00 00 01 00 00 00 04  |...&stat........|
00000110  00 00 00 fc 73 74 61 74  00 00 00 02 00 00 01 00  |....stat........|
00000120  00 00 06 44 4b 6e 44 42  00 00 00 00 00 00 07 44  |...DKnDB.......D|
00000130  00 00 04 68 4b 6e 44 42  00 00 00 01 00 00 0b ac  |...hKnDB........|
00000140  00 00 00 00 4b 6e 44 42  00 00 00 02 00 00 0b ac  |....KnDB........|
00000150  00 00 00 20 4b 6e 44 42  00 00 00 03 00 00 0b cc  |... KnDB........|
00000160  00 00 00 b6 4b 6e 44 42  00 00 00 05 00 00 0c 84  |....KnDB........|

These version 1 files don’t seem to have a static header, but they do have common bytes sequences. I will need to make more samples to get a proper signature constructed.

Now Contact skipped version 2 so the next version to be released was 3.0. What do these files look like?

% hexdump -C "Sample Contact File" | head
00000000  be de ad 01 00 00 00 03  00 00 a7 62 00 00 03 fc  |...........b....|
00000010  00 00 00 19 00 01 6e 9a  00 00 36 94 00 00 00 55  |......n...6....U|
00000020  00 01 42 ea be de ad 00  b4 25 c0 65 00 01 63 6b  |..B......%.e..ck|
00000030  00 00 00 87 00 00 00 6c  fc 63 8e a8 6f 62 6a 65  |.......l.c..obje|
00000040  00 00 00 81 00 00 00 00  00 00 00 00 00 00 00 00  |................|
00000050  00 01 00 00 00 80 00 00  00 36 63 6d 6e 74 00 00  |.........6cmnt..|
00000060  00 80 00 00 00 00 00 00  00 00 00 00 00 00 00 03  |................|
00000070  63 64 61 74 00 00 00 04  b4 25 c0 65 63 74 68 74  |cdat.....%.ectht|
00000080  00 00 00 00 63 73 65 6c  00 00 00 04 00 00 00 00  |....csel........|
00000090  be de ad 00 b4 25 c0 a4  44 4c 54 7a 00 00 00 ed  |.....%..DLTz....|

They have the same header as the file I received. Let’s try and open my file in Now Contact 3.9.

Oops, that didn’t work. There must be something in my file which tells the software it is from a newer version. After some digging in the file I can see some possible version text.

% hexdump -C "CONTACT FILE"
00000000  be de ad 01 00 00 00 03  00 1d 5f a9 00 7d d1 8e  |.........._..}..|
00000010  00 00 0e d8 98 89 d7 4b  00 8f 31 fd 00 00 3b f0  |.......K..1...;.|
00000020  00 de 76 56 be de ad 00  e6 02 b4 af 63 64 62 68  |..vV........cdbh|
00000030  00 00 00 00 00 00 01 8e  90 8f 56 13 00 0d 09 b4  |..........V.....|
00000040  00 0d 0b e0 00 0d 0c 50  00 0d 0b e8 00 0d 0c 58  |.......P.......X|
00000050  00 00 00 00 ba f3 fa eb  00 00 27 12 00 00 00 00  |..........'.....|
00000060  e6 02 b4 76 00 00 00 00  00 00 00 00 00 00 00 00  |...v............|
00000070  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
*
000003a0  00 50 00 21 00 f1 01 cf  00 00 00 82 00 00 00 34  |.P.!...........4|
000003b0  6b 65 79 73 00 00 00 82  00 00 00 00 00 00 00 00  |keys............|
000003c0  00 01 00 00 00 01 6b 65  79 68 00 00 00 15 76 34  |......keyh....v4|
000003d0  30 30 00 00 00 01 00 00  00 00 00 00 00 00 00 00  |00..............|
000003e0  00 00 00 00 be de ad 00  ba 42 cd 51 05 00 64 62  |.........B.Q..db|
000003f0  00 00 00 02 00 00 00 18  00 00 00 00 00 00 00 00  |................|
00000400  00 be de ad 00 b9 d0 15  aa 02 00 66 6c 00 00 00  |...........fl...|
00000410  14 00 00 00 2d b1 0b 27  34 76 34 30 30 00 00 00  |....-..'4v400...|

The file has some repeated text with v400. Sure enough opening the file in version 4 has no problems and I am able to view all the contacts and even allows me to export as a CSV. Looking at a sample file from a version 4 install confirms the version information.

% hexdump -C "Sample Contact File" | head
00000000  be de ad 01 00 00 00 03  00 07 5c 06 00 01 23 40  |..........\...#@|
00000010  00 00 00 10 00 17 9c 42  00 01 10 92 00 00 00 63  |.......B.......c|
00000020  00 10 37 3a be de ad 00  b7 39 82 7b 00 01 63 6b  |..7:.....9.{..ck|
00000030  00 00 00 80 00 00 5d 62  52 66 a1 a9 6f 62 6a 65  |......]bRf..obje|
00000040  00 00 00 86 00 00 00 00  00 00 00 00 00 00 00 00  |................|
00000050  00 06 00 00 00 81 00 00  00 34 6b 65 79 73 00 00  |.........4keys..|
00000060  00 81 00 00 00 00 00 00  00 00 00 00 00 00 00 01  |................|
00000070  6b 65 79 68 00 00 00 15  76 34 30 30 00 00 00 01  |keyh....v400....|
00000080  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
00000090  00 82 00 00 00 b4 6e 6f  74 65 00 00 00 82 00 00  |......note......|

Now Software updated the software for the a few years in the early 1990’s. There was Windows versions as well and the format is the same except one detail.

% hexdump -C NC452-Win-s01.NCT | tail
00004ff0  00 00 00 00 00 00 00 00  00 00 00 00 00 08 00 32  |...............2|
00005000  00 32 01 90 01 90 00 00  00 08 00 4f 00 32 02 01  |.2.........O.2..|
00005010  03 11 00 00 01 00 00 00  01 18 00 00 00 18 00 00  |................|
00005020  00 32 00 00 00 00 00 00  00 00 00 1c 00 32 00 00  |.2...........2..|
00005030  64 65 52 65 00 00 00 0a  00 01 ff ff 00 00 00 0c  |deRe......??....|
00005040  00 00 00 00 00 00 4e fa  69 da 88 7b 00 00 50 54  |......N?i?.{..PT|
00005050  77 69 6e 73                                       |wins|

It appears in version 4, the final bytes would indicate “wins” or “macs”. This continued in version 5 which came out in 2005.

% hexdump -C NC501-s01.nct | head
00000000  be de ad 01 00 00 00 03  00 00 02 cf 00 00 1e 14  |?ޭ........?....|
00000010  00 00 00 11 00 00 5c 93  00 00 43 7e 00 00 00 50  |......\...C~...P|
00000020  00 00 59 da 00 00 00 34  00 00 00 2c 00 00 05 5e  |..Y?...4...,...^|
00000030  00 00 02 cc be de ad 00  e6 02 e9 89 01 00 64 62  |...̾ޭ.?.?...db|
00000040  00 00 00 02 00 00 00 18  00 00 00 00 be de ad 00  |............?ޭ.|
00000050  e6 02 e9 91 63 64 62 68  00 00 00 00 00 00 01 8e  |?.?.cdbh........|
00000060  be 63 10 4d 00 7f 63 c8  00 7f 65 64 00 7f 66 d0  |?c.M..c?..ed..f?|
00000070  00 7f 66 bc 00 00 00 00  00 01 00 00 e6 02 e9 8a  |..f?........?.?.|
00000080  00 00 27 11 00 00 00 00  e6 02 e9 91 00 00 00 00  |..'.....?.?.....|
00000090  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
*
00005ad0  00 00 00 00 00 00 00 00  00 00 00 00 01 00 00 00  |................|
00005ae0  01 00 00 00 00 00 00 00  00 1e 00 00 00 00 00 00  |................|
00005af0  00 00 00 1c 00 1e ff ff  00 00 00 00 00 00 00 00  |......??........|
00005b00  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 08  |................|
00005b10  00 32 00 32 02 4f 03 11  00 00 59 da 57 7e a0 f3  |.2.2.O....Y?W~??|
00005b20  00 00 5b 28 6d 61 63 73                           |..[(macs|

Also in the version 5 samples, we still see the v400 text, so it appears the format was not changed.

% hexdump -C /Volumes/File\ Formats/Now/NC531-s01.nct       
00000000  be de ad 01 00 00 00 03  00 00 04 90 00 00 1e 14  |?ޭ.............|
00000010  00 00 00 11 00 00 54 83  00 00 43 7e 00 00 00 50  |......T...C~...P|
00000020  00 00 53 5e 00 00 00 34  00 00 00 2c 00 00 05 5e  |..S^...4...,...^|
00000030  00 00 02 cc be de ad 00  e6 04 44 0e 01 00 64 62  |...̾ޭ.?.D...db|
00000040  00 00 00 02 00 00 00 18  00 00 00 00 be de ad 00  |............?ޭ.|
00000050  e6 04 44 5c 63 64 62 68  00 00 00 00 00 00 01 8e  |?.D\cdbh........|
00000060  41 f7 ad 20 cc 8e b5 01  74 8f b5 01 e8 00 b6 02  |A?? ?.?.t.?.?.?.|
00000070  e4 00 b6 02 00 00 00 00  01 00 00 00 0e 44 04 e6  |?.?..........D.?|
00000080  11 27 00 00 00 00 00 00  5c 44 04 e6 00 00 00 00  |.'......\D.?....|
00000090  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
*
000001e0  00 00 00 00 00 00 00 00  00 00 00 00 00 00 be de  |..............??|
000001f0  ad 00 e6 04 44 0e 02 00  66 6c 00 00 00 03 00 00  |?.?.D...fl......|
00000200  00 2d b1 0b 27 34 76 34  30 30 00 00 00 01 00 00  |.-?.'4v400......|
00000210  00 00 00 00 00 00 00 00  00 00 00 be de ad 00 e6  |...........?ޭ.?|
00000220  04 44 0e 02 00 66 6c 00  00 00 05 00 00 00 2d b1  |.D...fl.......-?|
00000230  0b 27 34 76 34 30 30 00  00 00 01 00 00 00 00 00  |.'4v400.........|

Now Up-to-Date & Contact released version 5.3 around 2008 which finally provided support for Intel processors. It was the last version released before Now Software attempted a full re-write of the software in 2009 named Now X (code-named “NightHawk”). The software did not receive good reviews and by 2010 the company ceased operations. So far I have come up empty in getting a copy of this doomed version, but I will update this post if I am able to get my hands on a copy.

For now, you can take a look at some sample files on Github, which I will also add some PRONOM signatures to soon.

TurboTax

April 3, 2026 by Thor Leave a comment

With all the different file formats that are found in everyday computing, most formats which find their way to my archive have historical value. We know we can’t keep everything and have to assign value to all we decide to keep in for the long term. Some files have sensitive data and we have to follow guidelines for their proper handling. Identification of files helps us know what type of data might be kept inside the format, so often I need to also identify formats we don’t plan on keeping.

I was recently looking through a large digital collection and a report on the files which did not identify in the initial scan. A few popped out to me because of their extension, TAX. Tax records are one thing we need to identify so we can properly handle them, but not likely keep in our repository.

These tax files come from the popular US based TurboTax software. The software gets a new version for every year as tax laws are constantly changing. The software has also been around since 1984, so there are many versions to be aware of. Add to the fact there are personal and business versions along with DOS, Windows, and Macintosh versions, identification might get complicated. None of which are documented in the PRONOM registry. Wikidata is aware of a couple of the extensions, but does not have any signatures to help in identification.

Luckily, this collection of files I was processing had a number of years worth of records. Using them and a few others I was able to put together a decent timeline of formats used, at least from the early 1990’s on. The format seemed to settle on the .TAX extension around the 1994 Windows version. Before this, a group of files in DOS together stored the data. Let’s look at a sample of the 1994 file from Windows.

% hexdump -C TT1994.TAX | head
00000000  54 75 72 62 6f 54 61 78  0d 0a 46 6f 72 6d 61 74  |TurboTax..Format|
00000010  3d 57 49 4e 0d 0a 56 65  72 73 69 6f 6e 3d 31 33  |=WIN..Version=13|
00000020  0d 0a 45 6e 67 69 6e 65  56 65 72 73 53 74 72 3d  |..EngineVersStr=|
00000030  36 2e 30 30 2e 31 0d 0a  46 6f 72 6d 73 65 74 3d  |6.00.1..Formset=|
00000040  53 31 39 39 34 55 53 31  30 34 30 0d 0a 43 65 6e  |S1994US1040..Cen|
00000050  74 73 3d 59 65 73 0d 0a  53 68 6f 77 43 6f 6d 6d  |ts=Yes..ShowComm|
00000060  61 73 3d 59 65 73 0d 0a  53 68 6f 77 43 6f 6c 6c  |as=Yes..ShowColl|
00000070  61 70 73 69 62 6c 65 57  6f 72 6b 53 68 65 65 74  |apsibleWorkSheet|
00000080  73 3d 59 65 73 0d 0a 44  61 74 61 56 65 72 73 69  |s=Yes..DataVersi|
00000090  6f 6e 3d 31 0d 0a 46 6f  72 6d 46 69 6c 65 53 75  |on=1..FormFileSu|

I love these easy to identify format headers, but then jump to the next year, 1995, and the format changes.

% hexdump -C TT1995.TAX | head
00000000  c0 45 01 5f 0a 00 00 35  b5 06 36 2e 30 30 2e 31  |.E._...5..6.00.1|
00000010  00 00 c7 00 02 00 02 0d  00 00 00 b4 00 00 00 d9  |................|
00000020  00 0e 53 31 39 39 35 55  53 31 30 34 30 50 45 52  |..S1995US1040PER|
00000030  01 01 01 00 00 00 01 00  01 00 00 35 b5 00 0a c8  |...........5....|
00000040  00 01 00 01 09 00 00 00  cf 00 06 00 06 1d 00 00  |................|
00000050  00 3e 00 00 00 3e 00 00  00 64 00 00 00 64 00 00  |.>...>...d...d..|
00000060  00 7e 00 00 00 ce 13 7a  65 7a 50 65 72 73 69 73  |.~.....zezPersis|
00000070  74 65 6e 74 53 74 61 74  75 73 00 65 00 64 00 01  |tentStatus.e.d..|
00000080  00 00 00 00 00 00 ce 12  7a 74 6c 50 65 72 73 69  |........ztlPersi|
00000090  73 74 46 69 6c 65 44 61  74 61 00 00 00 00 00 00  |stFileData......|

The nice easy to read header is gone, but some other patterns start to appear. It seems most of the files from these early versions also used a code near the beginning that may help. “S1995US1040PER”, is similar to the “S1994US1040” in the 1994 file. One could assume the “1040” is the tax form most Americans are used to, along with “US” preceding the number. Then at the end of the string we see “PER”. This may refer to different versions of the Tax software, a Personal for the individual, and a possibly other versions for business as well. I believe TurboTax also had versions for Canadians as well, so there may be many variations on this string. This could get complex. Let’s jump ahead to a 1999 file.

% hexdump -C TurboTax1999.tax | head 
00000000  c0 45 01 5f 0a 00 00 54  6a 16 4c 39 31 30 32 31  |.E._...Tj.L91021|

00000030  00 0e 53 31 39 39 39 55  53 31 30 34 30 50 45 52  |..S1999US1040PER|
00000040  00 00 01 00 00 00 25 00  00 00 00 00 00 00 00 00  |......%.........|
00000050  01 19 12 8f f1 00 0a 00  00 00 00 00 00 00 00 00  |................|
00000060  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
00000070  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 c8  |................|
00000080  00 04 00 04 15 00 00 00  ec 05 00 00 c3 07 00 00  |................|
00000090  a3 08 00 00 c4 00 05 46  31 30 34 30 00 00 00 01  |.......F1040....|

The same string is visible, but if course with the year “1999”. We can also see a pattern with the first 4 bytes, “c0 45 01 5f” which seem to be consistent with the 1995 file. The file I have for 1998 is consistent as well. Jumping to the new millennium, we see a change.

% hexdump -C TurboTax2000.tax | head
00000000  c0 45 01 64 0a 00 00 2e  4f 18 4c 30 30 39 32 37  |.E.d....O.L00927|

00000030  00 dc 00 0b 53 32 30 30  30 55 53 31 31 32 30 00  |....S2000US1120.|
00000040  00 01 00 00 00 09 00 00  00 00 00 00 00 00 00 01  |................|

Two changes we see with this file. One, the ASCII string is different. S2000US1120, 1120 being the U.S. Corporation Income Tax Return. So this version of the software was different. The other change is the first 4 bytes. They changed to “c0 45 01 64”, with the last byte changing from 5F to 64. Jumping to 2003, we see the same values.

% hexdump -C TurboTax2003.tax | head 
00000000  c0 45 01 64 0d 00 00 80  1b 26 54 59 30 33 5f 4c  |.E.d.....&TY03_L|

00000040  58 03 00 dc 00 0e 53 32  30 30 33 55 53 31 30 34  |X.....S2003US104|
00000050  30 50 45 52 00 00 01 00  00 6a c6 00 00 00 00 00  |0PER.....j......|

Back to a 1040 form, but with the same header as the 2000 file. I am removing some lines, just to be safe and not exposing any personal data. In 2004 we see a major change in the format.

% hexdump -C TurboTax2004.tax | head 
00000000  54 54 46 4e 01 01 6f 68  dc 62 00 00 00 00 4b 01  |TTFN..oh.b....K.|

Again, removing some lines to ensure safety. This header is very different and their is no human readable ASCII in the file, which means it is binary and probably encoded. This header is new, TTFN is what I assume references TurboTax format? file? or possibly, “Turbo Tax Financial Network“?

This header is then used for the next few years ending in 2013, but before we get there, the extension makes a change as well. In 2008, instead of the simple .TAX extension, the software begins to save the tax file with the extension .TAX2008. I don’t have a 2008 document, but I do have a sample 2009 document.

% hexdump -C TurboTax2009.tax2009 | head
00000000  54 54 46 4e 01 01 b5 68  02 24 00 00 00 00 4b 0b  |TTFN...h.$....K.|
00000010  01 01 19 13 01 01 01 52  01 01 01 0b 01 01 4e 7a  |.......R......Nz|

With the last to use the TTFN header in 2013.

% hexdump -C TurboTax2013.tax2013 | head
00000000  54 54 46 4e 01 01 87 22  6a ec 00 00 00 00 50 bd  |TTFN..."j.....P.|

2014 is where I get a little confused. I have one file which uses the TTFN header and another which uses what becomes the standard going forward. But definitely in 2015, the format starts using the ZIP container as a structure for the format. Here is a sample from 2015

% hexdump -C TurboTax2015.tax2015 | head
00000000  50 4b 03 04 2d 00 02 00  08 00 e5 a6 51 48 ba 4d  |PK..-.......QH.M|
00000010  43 67 15 06 00 00 10 06  00 00 0c 00 14 00 6d 61  |Cg............ma|
00000020  6e 69 66 65 73 74 2e 78  6d 6c 01 00 10 00 00 00  |nifest.xml......|

If we take a look inside the ZIP container of a 2017 dummy sample.

% 7z l TurboTax2017.tax2017
7-Zip [64] 17.05 : Copyright (c) 1999-2021 Igor Pavlov : 2017-08-28
p7zip Version 17.05 (locale=utf8,Utf16=on,HugeFiles=on,64 bits,8 CPUs LE)

Scanning the drive for archives:
1 file, 769814 bytes (752 KiB)                                              

Listing archive: TurboTax2017.tax2017

--
Path = TurboTax2017.tax2017
Type = zip
WARNINGS:
Headers Error
Physical Size = 769814

   Date      Time    Attr         Size   Compressed  Name
------------------- ----- ------------ ------------  ------------------------
2026-03-28 20:25:38 .....          576          581  manifest.xml
2026-03-28 20:25:38 .....       768688       768923  084A702A-CD3D-4623-B8B7-EE4800BB151F
------------------- ----- ------------ ------------  ------------------------
2026-03-28 20:25:38             769264       769504  2 files

Warnings: 1

The files all seem to have a manifest.xml and a unique identifier. 7-Zip also mentions a header issue with the ZIP files. Something maybe done on purpose? Now comes the odd part, the manifest.xml file does not render as an XML file, it is binary.

% hexdump -C TurboTax2017/manifest.xml | head
00000000  a1 b1 fe fb 37 18 dd 9c  08 2d 9c 86 23 00 10 fa  |....7....-..#...|
00000010  12 60 92 bb dc 92 a5 df  1a 24 16 4e a9 28 89 80  |.`.......$.N.(..|
00000020  64 33 66 55 c5 93 f0 68  44 d0 7c f9 56 86 42 2c  |d3fU...hD.|.V.B,|
00000030  80 ba 8a 95 2a 82 6d 32  75 84 b1 f1 e2 18 93 5c  |....*.m2u......\|
00000040  82 4d 18 f9 ed 23 4f dc  d6 b5 7f f2 20 1e 30 59  |.M...#O..... .0Y|
00000050  d5 7f 47 7d aa f5 8d bd  8b 10 20 ec 8a c7 43 df  |..G}...... ...C.|
00000060  52 90 a9 70 4d 68 b4 76  fa c8 37 85 f5 56 25 82  |R..pMh.v..7..V%.|
00000070  ea 16 06 54 b0 b4 bc 43  16 fb 70 7b 7a 79 a5 8b  |...T...C..p{zy..|
00000080  3c 79 7d ef ac 32 fc 35  ce 0f fa a2 6f e7 c3 a4  |<y}..2.5....o...|
00000090  92 a1 a4 c8 83 dd 9f 32  f4 ea d3 1a eb 89 15 a3  |.......2........|

Of the samples I have which have a manifest.xml, they all begin with “a1 b1 fe fb”. Which apparently is the header for an AES CBC encrypted file. A clever person was able to decrypt the file to reveal the actual XML.

TurboTax isn’t sold on physical disk anymore, but you can download the current tax year version from their website. I am not a user of their product so I am not sure if the latest version still saves files in the same way. If you do use it currently, I would love to know if it is still the same.

So to recap, the headers are:

1994 “TurboTax Format=WIN Version=13
1995-99 “C045015F”
2000-03 “C0450164”
2004-13 “TTFN”
2014-current “ZIP Container”

This should be enough to create five new signatures for identification. Extensions will be a problem since they change very year, but we can add them to the list. With these signatures we can now identify all the tax files we have and set them aside if not needed.

iView

February 13, 2026 by Thor Leave a comment

It seems to be a common theme through the history of software that some titles, get bought, sold, rebranded, integrated, and discontinued by a number of companies. I find it interesting to find out a popular software title’s humble beginnings. Often when a piece of software gets bought, the file formats don’t change much, at least at first.

A little shareware program called iView started out by a company called Script Software in 1996. They later changed their name to Plum Amazing. iView then became iView Multimedia, then an iView MediaPro version before it was bought by Microsoft where they changed the name to Expression Media. After a couple years the software was bought by Phase One and then discontinued. Let’s take a look at the history.

iView, according to their website in 1997, is simply the easiest and fastest way to view and catalog pictures for the Mac. The software initially only worked on the Macintosh and the Catalog file it produced did not have an extension. But they did have a Type/Creator code. A catalog produced by version 2 of the iView software was IVWc/IVW2.

% hexdump -C iView2-s01 | head
00000000  00 00 00 05 30 32 35 69  47 4f 53 58 3a 4c 69 62  |....025iGOSX:Lib|
00000010  72 61 72 79 3a 41 70 70  6c 69 63 61 74 69 6f 6e  |rary:Application|
00000020  20 53 75 70 70 6f 72 74  3a 41 70 70 6c 65 3a 69  | Support:Apple:i|
00000030  43 68 61 74 20 49 63 6f  6e 73 3a 46 72 75 69 74  |Chat Icons:Fruit|
00000040  3a 47 72 65 65 6e 20 41  70 70 6c 65 2e 67 69 66  |:Green Apple.gif|
00000050  03 46 44 63 00 00 0f ef  03 46 44 63 08 93 65 58  |.FDc.....FDc..eX|
00000060  00 01 5c 50 00 01 5a c8  68 ff f7 40 08 93 65 4b  |..\P..Z.h..@..eK|
00000070  08 13 9a c0 ff d1 3a 80  00 a3 c8 a0 00 00 28 00  |......:.......(.|
00000080  00 05 48 64 00 00 a0 24  00 00 39 ec 00 00 00 0a  |..Hd...$..9.....|
00000090  08 93 65 64 44 00 00 24  3d 14 51 84 3d 9d 74 bc  |..edD..$=.Q.=.t.|

The iView format is a proprietary binary format used to store a catalog of multimedia formats with their metadata and thumbnail. The media viewer had support for quite a few popular formats. The file seems to have paths to each of the files it has cataloged, so some of these iView files can get pretty large.

In 2003 the iView software was ported to Windows. With that brought a formal extension to the catalog format. This was also the time the iView software made the switch from the classic MacOS to MacOSX and extensions were also encouraged at this time. iView had two different version a standard shareware version and a Media Pro version, each had their own version numbers. iView MediaPro was not compatible with Macintosh 68K machines or systems earlier than 8.6. The last Media Pro version was version 3.8.6. You can get most of the old software versions here.

% hexdump -C iViewPro302-s01.ivc | head
00000000  00 00 00 00 30 32 35 69  46 53 4d 21 00 00 00 2e  |....025iFSM!....|
00000010  66 6c 64 72 00 00 00 2e  00 00 00 00 00 00 00 06  |fldr............|
00000020  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
00000030  00 00 00 00 00 00 00 00  3c 72 6f 6f 74 3e 42 4c  |........<root>BL|
00000040  44 4f 00 00 00 0c 31 00  02 00 00 00 01 01 00 00  |DO....1.........|
00000050  00 00 55 53 46 33 00 00  00 02 01 03 43 4d 52 53  |..USF3......CMRS|
00000060  00 00 01 ed 01 00 00 02  0a 01 00 00 00 00 00 00  |................|
00000070  00 02 f2 01 00 00 00 00  00 00 00 00 a2 01 00 00  |................|
00000080  00 00 02 01 03 00 00 00  a1 01 00 00 00 00 00 00  |................|
00000090  00 00 48 00 00 00 00 00  00 00 00 00 03 01 00 00  |..H.............|

This time with an extension, IVC, but with a familiar pattern at the beginning. The string 025i, hex values “30323569” at byte 4. The iView files from previous versions have the same bytes, but only version Media Pro 2 & 3 files match an existing PRONOM identification.

% sf iViewPro302-s01.ivc 
filename : 'iViewPro302-s01.ivc'
filesize : 3757
modified : 2025-09-17T17:39:27-06:00
errors   : 
matches  :
  - ns      : 'pronom'
    id      : 'fmt/647'
    format  : 'Microsoft Expression Media'
    version : '2'
    mime    : 
    class   : 'Presentation'
    basis   : 'extension match ivc; byte match at [[4 4] [3737 16]]'

These are iView Media Pro files, why are they identifying as Microsoft Expression Media files? That is because Microsoft bought iView Media Pro on June 27, 2006. Microsoft rebranded the software as Expression Media, not to be confused with Expression Studio. It was available for Windows and Macintosh, but not everyone was happy with the purchase. Version 1 of Expression Media was released the next year and was a free upgrade for iView Media Pro users. The format doesn’t appear to have changed much at all. In fact a comparison of an iView Media Pro 3 file with no content and an Expression Media 1 file are practically identical.

% hexdump -C Expression1-s01.ivc | head
00000000  00 00 00 00 30 32 35 69  46 53 4d 21 00 00 00 2e  |....025iFSM!....|
00000010  66 6c 64 72 00 00 00 2e  00 00 00 00 00 00 00 06  |fldr............|
00000020  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
00000030  00 00 00 00 00 00 00 00  3c 72 6f 6f 74 3e 42 4c  |........<root>BL|
00000040  44 4f 00 00 00 0c 31 00  02 00 00 00 01 01 00 00  |DO....1.........|
00000050  00 00 55 53 46 33 00 00  00 02 01 03 43 4d 52 53  |..USF3......CMRS|
00000060  00 00 01 ed 01 00 00 02  0a 01 00 00 00 00 00 00  |................|
00000070  00 02 f2 01 00 00 00 00  00 00 00 00 a2 01 00 00  |................|
00000080  00 00 02 01 03 00 00 00  a1 01 00 00 00 00 00 00  |................|
00000090  00 00 48 00 00 00 00 00  00 00 00 00 03 01 00 00  |..H.............|

The next year brought a version 2 of Expression Media, often found bundled with a Special Edition of Office 2008 for Mac, but also a standalone product for Windows. But the catalog format remained the same.

% hexdump -C Expression2-s01.ivc | head       
00000000  00 00 00 04 30 32 35 69  3a 43 3a 5c 44 4f 43 55  |....025i:C:\DOCU|
00000010  4d 45 7e 31 5c 41 4c 4c  55 53 45 7e 31 5c 44 4f  |ME~1\ALLUSE~1\DO|
00000020  43 55 4d 45 7e 31 5c 4d  59 50 49 43 54 7e 31 5c  |CUME~1\MYPICT~1\|
00000030  53 41 4d 50 4c 45 7e 31  5c 57 69 6e 74 65 72 2e  |SAMPLE~1\Winter.|
00000040  6a 70 67 00 00 00 00 00  00 00 00 00 00 00 00 00  |jpg.............|
00000050  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|

Even though all of these versions have the same 4 bytes at the beginning, not all of them match the current PRONOM signature. fmt/647 is specifically for Expression Media version 2 files, but also identifies iView Media Pro 2 & 3 and Expression Media 1 files. It doesn’t identify earlier files because the signature is also looking for some bytes near the end of the file.

% hexdump -C iViewPro302-s01.ivc | tail       

00000e90  00 00 00 00 00 00 00 00  00 53 56 61 72 00 00 00  |.........SVar...|
00000ea0  04 00 00 01 f4 30 32 35  69 00 00 00 08           |.....025i....|

There is the same 4 bytes at the end of the file as well. There is also a string used in the signature at the end, “SVar”. Not sure what the string is used for but it is not in earlier versions.

% hexdump -C iView157-01 | tail 

00000420  00 00 00 00 00 00 00 00  00 00 00 00 30 32 35 69  |............025i|
00000430  00 00 00 08                                       |....|

And the even earlier versions are missing the “025i” at the end.

% hexdump -C iView2-s01 | tail

000062b0  2a ae ed d4 1a eb d4 04  c4 88 76 88 c4 d6 d4 04  |*.........v.....|
000062c0  c4 79 69 79 c4 d6 d4 04  c4 78 67 78 c4 ec d4 04  |.yiy.....xgx....|
000062d0  81 d4 f1 d4 00 ff                                 |......|

Microsoft Expression Media was short lived. Microsoft decided to sell off the software to Phase One in 2010. Phase One is the developer of Capture One, a professional photo editing program. It makes sense they would want a cataloging tool to go with their flagship product. Phase One retained the name Media Pro from the original iView Media Pro software.

Phase One took the software and did make modifications, starting with the extension used to store the catalogs. They also decided to adjust the format slightly, changing the “025i” bytes to “030i”.

% hexdump -C PhaseOneMediaProv1.mpcatalog | head 
00000000  00 00 00 05 30 33 30 69  4a 4d 61 63 31 30 37 3a  |....030iJMac107:|
00000010  4c 69 62 72 61 72 79 3a  41 70 70 6c 69 63 61 74  |Library:Applicat|
00000020  69 6f 6e 20 53 75 70 70  6f 72 74 3a 41 70 70 6c  |ion Support:Appl|
00000030  65 3a 69 43 68 61 74 20  49 63 6f 6e 73 3a 46 72  |e:iChat Icons:Fr|
00000040  75 69 74 3a 47 72 65 65  6e 20 41 70 70 6c 65 2e  |uit:Green Apple.|
00000050  67 69 66 00 00 00 00 00  00 00 00 00 00 00 00 00  |gif.............|
00000060  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|

The Phase One Media Pro software uses the extension MPCATALOG, but can also open the older IVC catalogs as well.

% sf PhaseOneMediaProv1.mpcatalog 

filename : 'PhaseOneMediaProv1.mpcatalog'
filesize : 21353
modified : 2025-09-16T20:37:07-06:00
errors   : 
matches  :
  - ns      : 'pronom'
    id      : 'fmt/648'
    format  : 'Media View Pro'
    version : 
    mime    : 
    class   : 'Presentation'
    basis   : 'extension match mpcatalog; byte match at [[4 4] [21329 16]]'

MPCATALOG files are identified in PRONOM using a similar signature as the one used for the IVC files. Although the name of the format isn’t quite right, MediaPro is probably a better name.

So it seems the identification is already available in PRONOM for the later MediaPro files, both iView MediaPro and Expression Media, and a second identification for the PhaseOne catalog. So we will need to either adjust the identification to include the earlier iView versions and adjust the names or we can create a new signature for the older versions. It would be good to find out what version added the change to the format, but with all the different software versions, it might be hard to nail down.

Enjoy some samples.

December 5, 2025 by Thor Leave a comment

The main subject of these posts is about Obsolete software and file formats. I prefer to focus on older software titles and collect them when I can. I have also found older Macintosh software to be particularly interesting as many of the qualities of early Macintosh use is lost today. In researching a very early Macintosh title, I came across an article from 1999 written by the Washington Post, the article, now 26 years old, was already commenting about “antique” software which was less than 20 years old at the time. Is there a term for even more antique? The title of the article? “Old Enthusiasts Are Scouring the Web to Find ‘Antique’ Software”. I feel this hasn’t changed, I still scour the web to find old software, and if the enthusiasts were “old” 26 years ago, then I am ancient.

Back in 1983, a little company called Living Videotext run by Dave Winer, who had developed a couple products for the Apple II, saw an opportunity to bring their product to the Macintosh. Their product, ThinkTank, was the fourth title to ship for the new Macintosh released in 1984.

Thinktank was an “idea processor“, not a word processor, but “a tool for organizing your thoughts on a computer screen. You could create an outline, then indent, move an item up a list, or out a level. Flesh out the details, and quickly record a top-level idea you had overlooked.” It was the beginning of outliner tools created by the company.

% hexdump -C Sample | head
00000000  2e 48 45 41 44 20 30 20  2b 20 20 4d 61 6a 6f 72  |.HEAD 0 +  Major|
00000010  20 4c 65 61 67 75 65 20  42 61 73 65 62 61 6c 6c  | League Baseball|
00000020  20 54 65 61 6d 73 0d 2e  48 45 41 44 20 31 20 2b  | Teams..HEAD 1 +|
00000030  20 20 4c 65 61 67 75 65  73 20 61 6e 64 20 44 69  |  Leagues and Di|
00000040  76 69 73 69 6f 6e 73 0d  2e 48 45 41 44 20 32 20  |visions..HEAD 2 |
00000050  2b 20 20 41 6d 65 72 69  63 61 6e 20 4c 65 61 67  |+  American Leag|
00000060  75 65 0d 2e 48 45 41 44  20 33 20 2b 20 20 57 65  |ue..HEAD 3 +  We|
00000070  73 74 65 72 6e 20 44 69  76 69 73 69 6f 6e 0d 2e  |stern Division..|
00000080  48 45 41 44 20 34 20 2d  20 20 43 61 6c 69 66 6f  |HEAD 4 -  Califo|
00000090  72 6e 69 61 20 41 6e 67  65 6c 73 0d 2e 48 45 41  |rnia Angels..HEA|

The files created by ThinkTank are plain text with the ASCII “HEAD”. There was also a DOS version of ThinkTank, but the files used were .DB and .SAV, although the templates in the .TXT format did use this same format.

% hexdump -C SAMPLE.TXT | head
00000000  2e 48 45 41 44 20 30 20  2b 20 20 50 65 72 66 6f  |.HEAD 0 +  Perfo|
00000010  72 6d 61 6e 63 65 20 52  65 76 69 65 77 0d 0a 2e  |rmance Review...|
00000020  48 45 41 44 20 31 20 2d  20 20 4e 61 6d 65 3a 20  |HEAD 1 -  Name: |
00000030  0d 0a 2e 48 45 41 44 20  31 20 2d 20 20 4a 6f 62  |...HEAD 1 -  Job|
00000040  20 54 69 74 6c 65 3a 20  0d 0a 2e 48 45 41 44 20  | Title: ...HEAD |
00000050  31 20 2d 20 20 52 65 76  69 65 77 20 44 61 74 65  |1 -  Review Date|
00000060  3a 20 0d 0a 2e 48 45 41  44 20 31 20 2d 20 20 52  |: ...HEAD 1 -  R|
00000070  65 76 69 65 77 20 70 65  72 69 6f 64 20 66 6f 72  |eview period for|
00000080  3a 20 0d 0a 2e 48 45 41  44 20 31 20 2b 20 20 4f  |: ...HEAD 1 +  O|
00000090  62 6a 65 63 74 69 76 65  73 20 4d 65 74 2f 4e 6f  |bjectives Met/No|

Turns out this was a special format they called “dot-head“, aptly named for the head of the file. It was used as an interchange format to move outlines between ThinkTank, another program called Ready!, and the later product MORE.

MORE was developed to be multiple tools in one. Meant to “Unite idea processing technology with the desktop publishing revolution“. MORE replaced ThinkTank in 1986 and promised more flexibility by creating charts and presentations quickly from your outline. MORE used the same dot-head format initially, also the ASCII could be in lowercase.

% hexdump -C MORE1 | head
00000000  2e 68 65 61 64 20 30 20  2b 20 20 48 6f 6d 65 0d  |.head 0 +  Home.|
00000010  2e 68 65 61 64 20 31 20  2d 20 20 0d 2e 68 65 61  |.head 1 -  ..hea|
00000020  64 20 31 20 2d 20 20 54  65 73 74 69 6e 67 0d 2e  |d 1 -  Testing..|
00000030  68 65 61 64 20 31 20 2d  20 20 0d                 |head 1 -  .|

In 1987 Living Videotext was purchased by Symantec. Shortly after Symantec released MORE II and a rebranded DOS application called GrandView based on ThinkTank.

Let’s take a look at GrandView, it was built from the DOS version of ThinkTank and compatible with the same formats. It had great reviews at the time and provided the first outliner for Symantec. It was written by the developer, John Friend, who created PC Outline which was often bundled with WordStar.

GrandView could import and export into any of the other products.

GrandView version 1 went with a new file format.

% hexdump -C PROJECT.GV | head
00000000  0b 00 01 00 1a 4a 4c 46  5f 49 44 06 00 02 00 01  |.....JLF_ID.....|
00000010  01 0a 00 03 00 16 00 26  00 2c 00 05 00 05 00 78  |.......&.,.....x|
00000020  06 00 07 00 ff ff 06 00  1a 00 01 00 26 00 08 00  |............&...|
00000030  26 00 ee 4d 02 00 00 00  c0 a8 00 00 00 00 00 00  |&..M............|
00000040  01 00 00 00 00 00 01 00  00 00 00 00 00 00 00 01  |................|
00000050  00 00 22 00 0a 00 20 1c  00 00 20 1c 00 00 d0 b6  |.."... ... .....|
00000060  00 00 10 ef 00 00 20 1c  00 00 20 1c 00 00 60 35  |...... ... ...`5|
00000070  01 00 01 00 05 00 10 00  62 15 00 13 00 cc 93 88  |........b.......|
00000080  10 54 54 59 2e 50 44 56  00 00 00 00 00 00 2c 00  |.TTY.PDV......,.|
00000090  1c 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|

GrandView 2.0 also used the same format.

% hexdump -C TEST.GV | head
00000000  0b 00 01 00 1a 4a 4c 46  5f 49 44 06 00 02 00 01  |.....JLF_ID.....|
00000010  02 0a 00 03 00 02 00 2b  00 3a 00 05 00 05 00 78  |.......+.:.....x|
00000020  06 00 07 00 ff ff 26 00  08 00 2b 00 d5 3e 02 00  |......&...+..>..|
00000030  00 00 d0 b6 00 00 00 00  00 00 01 00 00 00 00 00  |................|
00000040  01 00 00 00 00 00 00 00  00 01 00 00 22 00 0a 00  |............"...|
00000050  20 1c 00 00 20 1c 00 00  d0 b6 00 00 10 ef 00 00  | ... ...........|
00000060  20 1c 00 00 20 1c 00 00  60 35 01 00 01 00 05 00  | ... ...`5......|
00000070  10 00 60 15 00 13 00 9b  5d 83 14 48 50 4c 33 2e  |..`.....]..HPL3.|
00000080  50 44 56 00 00 00 00 00  2c 00 1c 00 00 00 00 00  |PDV.....,.......|
00000090  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|

GrandView was also compatible with the Macintosh counterpart, MORE.

Symantec then released a new version of the MORE software for the Macintosh in 1988, adding new presentation features. MORE II went away from the dot-head format and used a new proprietary format.

% hexdump -C MORE2-s01 | head
00000000  00 03 4d 52 49 49 00 80  00 00 00 80 00 00 00 78  |..MRII.........x|
00000010  00 00 00 00 00 00 00 00  00 00 00 f8 00 00 00 a8  |................|
00000020  00 00 01 a0 00 00 00 28  00 00 01 c8 00 00 00 18  |.......(........|
00000030  00 00 01 e0 00 00 00 00  00 00 01 e0 00 00 00 0c  |................|
00000040  00 00 01 ec 00 00 00 0c  00 00 01 e0 00 00 00 00  |................|
00000050  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
*
00000080  00 03 00 00 00 48 00 48  00 00 00 00 02 d8 02 28  |.....H.H.......(|
00000090  ff e1 ff e2 02 f9 02 46  03 47 05 28 03 fc 00 02  |.......F.G.(....|
000000a0  00 00 00 48 00 48 00 00  00 00 02 d8 02 28 00 01  |...H.H.......(..|

Then in 1990 Symantec released MORE 3.0 with even more features and improvements to the user experience. Also adding a companion tool, MORE Graph.

% hexdump -C MORE3-s01 | head
00000000  00 06 4d 4f 52 33 00 80  00 00 00 80 00 00 00 78  |..MOR3.........x|
00000010  00 00 00 f8 00 00 01 b4  00 00 02 ac 00 00 00 a8  |................|
00000020  00 00 11 16 00 00 00 32  00 00 11 48 00 00 00 20  |.......2...H... |
00000030  00 00 11 68 00 00 00 00  00 00 11 68 00 00 00 10  |...h.......h....|
00000040  00 00 11 83 00 00 00 0c  00 00 11 68 00 00 00 00  |...........h....|
00000050  00 00 00 00 00 00 03 54  00 00 0d c2 00 00 11 78  |.......T.......x|
00000060  00 00 00 0b 00 00 00 00  00 00 00 00 00 00 00 00  |................|
00000070  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
00000080  00 03 00 00 00 48 00 48  00 00 00 00 02 d8 02 28  |.....H.H.......(|
00000090  ff e1 ff e2 02 f9 02 46  03 47 05 28 03 fc 00 02  |.......F.G.(....|

The MORE 3 format got a new header but appears similar in structure to the previous version. And the new companion tool MORE Graph had yet another format.

% hexdump -C MORE3-graph | head 
00000000  00 01 00 00 01 09 00 00  00 0c 00 01 09 19 80 00  |................|
00000010  01 09 54 65 73 74 00 00  00 07 00 01 00 03 00 00  |..Test..........|
00000020  00 0b 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
00000030  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
*
00000610  00 00 00 00 00 00 00 01  06 47 65 6e 65 76 61 00  |.........Geneva.|
00000620  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
*
00000670  01 00 00 1f ca 33 a0 00  2b 00 04 02 d8 03 d8 09  |.....3..+.......|
00000680  57 6f 72 6b 73 68 65 65  74 00 00 00 00 00 00 00  |Worksheet.......|

Luckily these early Macintosh based formats didn’t use a resource fork, making them fully compatible with their PC counterpart.

One of the coolest parts of this long list of outline software, is that years later, after Symantec discontinued the product, the original creator, Dave Winer, petitioned Symantec to allow him to release the antique software free and clear to the public. How cool is that? I would really like to see this happen more as other software titles die and get swept under the rug leaving the community to try and find copies, preserve them and make sense of the formats. Not only were the early versions made available, a tool was built to migrate the MORE format to more open XML, allowing the ideas trapped in these ancient formats to be re-imagined.

MORE 3.1 was the final version of the software to be released by Symantec. The files produced by MORE 3.1 have an identical header to the standard 3.0 version. Probably only need one signature for the two versions.

% hexdump -C MORE31-s01 | head 
00000000  00 06 4d 4f 52 33 00 80  00 00 00 80 00 00 00 78  |..MOR3.........x|
00000010  00 00 00 f8 00 00 01 b4  00 00 02 ac 00 00 00 a8  |................|
00000020  00 00 11 16 00 00 00 32  00 00 11 48 00 00 00 20  |.......2...H... |
00000030  00 00 11 68 00 00 00 00  00 00 11 68 00 00 00 10  |...h.......h....|
00000040  00 00 11 83 00 00 00 0c  00 00 11 68 00 00 00 00  |...........h....|
00000050  00 00 00 00 00 00 03 54  00 00 0d c2 00 00 11 78  |.......T.......x|
00000060  00 00 00 0b 00 00 00 00  00 00 00 00 00 00 00 00  |................|
00000070  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
00000080  00 03 00 00 00 48 00 48  00 00 00 00 02 d8 02 28  |.....H.H.......(|
00000090  ff e1 ff e2 02 f9 02 46  03 47 05 28 03 fc 00 02  |.......F.G.(....|

If you would like to try out the MORE software, download this disk image, and drag onto the Macintosh emulator below. The image will automatically mount and you should be able to take MORE 3.1 for a spin!

Outlining software still has a good place in idea generation and presentation. OmniOutliner can probably trace its roots to these “Antique” titles! Stay tuned for some PRONOM signatures to go along with these many format examples. For now you can gather some of the samples from my Github page.

Textor

October 17, 2025 by Thor Leave a comment

Many of us lived through the Word Processing Wars of the late 1980’s and early 1990’s. It was an overwhelming time of many options to choose from, each providing new features with each update, trying to become the leader in the word processing game. Early DOS versions had steep learning curves which built loyalty to those who committed to muscle memory all the key commands needed to produce the perfect document. With the many options to choose for word processing, brought just as many file formats to save your work. Many titles used the same file extensions or encouraged users to choose their own, using their initials instead. Often the files created by these software titles, used standard ASCII text, but mixed in their own formatting codes which all tend to make identification in preservation difficult.

I recently acquired a large lot of older software. It has been fun sorting through it and learning about the different titles. One title stuck out, as I hadn’t heard of it before. I found an old article which included the software in a comparison of word processing software in 1993. The article compares the following executive word processing software.

LotusWrite 2.0
JustWrite 2.0
Professional Write Plus 1.0
CA-Textor 6.0
Ami Pro 3.0
Word for Windows 2.0a
WordPerfect 5.1

You are probably familiar with a few of these titles, but the one that stuck out to me was CA-Textor 6.0. In my lot of software I came across a two disk installer for CA-Textor 6.0 for Windows. Developed by Computer Associates International, Inc. who opened their doors in 1976 and developed or acquired many software titles.

In the case of CA-Textor, it was purchased from a French company, Talor à Paris, who had been producing Textor, a popular word processor in France, for DOS since the 1983. The original developer, Thierry Lorthiois, had high hopes for a French product to exist in a world of giant American companies. Even with over 70,000 copies sold, the release of Textor 4 in 1988 saw much marketshare lost to Microsoft Word. By 1989, Computer Associates purchased Textor and rebranded Textor 5 for DOS and added mouse compatibility, then in late 1991 released a Windows version of Textor and named it CA-Textor; in line with their other products. It would be the only version released by Computer Associates and disappeared into the void like many word processors of the time.

CA-Textor 6.0 for Windows appears to be a well designed word process for its time. The reviews were mixed, but scored decently in many comparisons. In the article mentioned above, it scored the lowest of all the word processors. The final result says:

CA-Textor fails to offer the usability shortcuts of the other programs, and scores well below the other programs in editing, formatting and graphics manipulation.

It was possibly reviews like this which caused Computer Associates to never update or release a new version of the software.

The first thing I noticed with the software was the way the software handles files. The software defaults to a new “Library” method which stores each file connected to a Library which stores a folder of files and their full names and descriptions.

Single files can still be saved from CA-Textor by choosing DOS file, but the extension used is not clear.

Using .TXT for a formatted file seems like a bad recommendation. So let’s take a look at a few of the files generated by by CA-Textor.

The new Library File has the extension .TAL.

 % hexdump -C TEXTOR.TAL | head
00000000  43 3a 5c 54 45 58 54 4f  52 5c 54 45 58 54 4f 52  |C:\TEXTOR\TEXTOR|
00000010  2e 54 41 4c 00 00 00 00  00 00 00 00 00 00 00 00  |.TAL............|
00000020  00 00 00 00 00 00 00 00  00 54 45 58 54 4f 52 00  |.........TEXTOR.|
00000030  00 00 c1 46 8d ec 1a 47  8d ec c1 46 8d ec 05 00  |...F...G...F....|
00000040  01 00 01 00 00 00 00 00  00 00 00 00 00 ff ff 54  |...............T|
00000050  42 58 54 66 00 0a 00 00  00 00 00 65 00 00 00 00  |BXTf.......e....|
00000060  00 00 00 00 00 54 65 73  74 00 00 00 00 00 00 00  |.....Test.......|
00000070  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
00000080  00 00 00 00 00 00 00 00  00 00 00 00 00 00 54 45  |..............TE|
00000090  58 54 4f 52 00 00 00 e9  46 8d ec e9 46 8d ec 01  |XTOR....F...F...|

The first few bytes is the path of the file.

It also seems the individual files connected to the Library also have the .TAL extension. But they have a different header.

% hexdump -C OBSO0006.TAL | head
00000000  01 14 00 45 54 01 02 58  54 00 06 49 57 01 00 00  |...ET..XT..IW...|
00000010  00 00 00 65 00 87 16 06  80 00 4f 62 73 6f 6c 65  |...e......Obsole|
00000020  74 65 54 68 6f 72 00 00  00 00 00 00 00 00 00 00  |teThor..........|
00000030  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
00000040  00 00 00 4f 62 73 6f 6c  65 74 65 54 68 6f 72 00  |...ObsoleteThor.|
00000050  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
00000060  00 00 00 00 00 00 00 00  00 00 00 00 54 45 58 54  |............TEXT|
00000070  4f 52 00 00 00 f4 b0 8e  ec 1d b1 8e ec 00 00 00  |OR..............|
00000080  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
00000090  00 00 00 00 00 00 00 00  00 00 07 11 00 02 00 00  |................|

The CA-Textor software installed some sample files during installation we can also look at.

% hexdump -C SAMPLE01.SAM | head
00000000  01 14 00 45 54 01 02 58  54 00 06 49 57 3b 00 00  |...ET..XT..IW;..|
00000010  00 da 05 00 00 04 01 06  80 00 44 3a 5c 44 4f 43  |..........D:\DOC|
00000020  54 5c 53 41 4d 50 4c 45  30 31 2e 53 41 4d 00 00  |T\SAMPLE01.SAM..|
00000030  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
00000040  00 00 00 44 4f 53 00 00  00 00 00 00 00 00 00 00  |...DOS..........|
00000050  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
00000060  00 00 00 00 00 00 00 00  00 00 00 00 43 72 61 69  |............Crai|
00000070  67 00 00 00 00 34 24 b5  29 a1 8a b5 29 00 00 00  |g....4$.)...)...|
00000080  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
00000090  00 00 00 00 00 00 00 00  00 00 07 61 00 02 00 00  |...........a....|

A later build of CA-Textor had more sample files, but this time with a different extension. Also has the same bytes at the beginning of the file.

% hexdump -C TEMP0005.TEM | head
00000000  01 14 00 45 54 01 02 58  54 00 06 49 57 22 00 00  |...ET..XT..IW"..|
00000010  00 e0 40 12 00 10 00 06  80 00 46 61 78 20 43 6f  |..@.......Fax Co|
00000020  76 65 72 20 31 00 00 00  00 00 00 00 00 00 00 00  |ver 1...........|
00000030  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
00000040  00 00 00 54 65 6d 70 6c  61 74 65 73 00 00 00 00  |...Templates....|
00000050  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
00000060  00 00 00 00 00 00 00 00  00 00 00 00 54 45 58 54  |............TEXT|
00000070  4f 52 00 00 00 12 04 d7  2b 5d 80 e1 2b 00 00 00  |OR......+]..+...|
00000080  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
00000090  00 00 00 00 00 00 00 00  00 00 10 3d 02 01 00 00  |...........=....|

The good news is there is a pattern emerging, but not the same extension. I get the feeling they didn’t see much value in the extension for this software. When I save a file in the software as a DOS file, it doesn’t automatically pick an extension for me. I left the extension off and saved a file in the DOS format.

% hexdump -C TEST1 | head
00000000  01 14 00 45 54 01 02 58  54 00 06 49 57 1d 00 00  |...ET..XT..IW...|
00000010  00 2f 67 d8 6e 1d 00 06  80 00 43 3a 5c 54 45 58  |./g.n.....C:\TEX|
00000020  54 4f 52 5c 54 45 53 54  31 00 00 00 00 00 00 00  |TOR\TEST1.......|
00000030  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
00000040  00 00 00 44 4f 53 00 00  00 00 00 00 00 00 00 00  |...DOS..........|
00000050  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
00000060  00 00 00 00 00 00 00 00  00 00 00 00 54 45 58 54  |............TEXT|
00000070  4f 52 00 00 00 c0 ce d0  ae 99 53 8d ec 00 00 00  |OR........S.....|
00000080  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
00000090  00 00 00 00 00 00 00 00  00 00 07 0d 00 02 00 00  |................|

We see the same pattern at the head, but also a clear mention of DOS, just like the sample files included. Since I don’t have any earlier DOS versions to compare, I have to assume this is the same with at least Textor 5. I did find a mention of someone trying to convert their older Textor 5 documents to modern formats and they mention they are in the TAL format.

% sf OBSO0006.TAL 

filename : 'OBSO0006.TAL'
filesize : 915
modified : 2025-10-05T13:26:56-06:00
errors   : 
matches  :
  - ns      : 'pronom'
    id      : 'UNKNOWN'
    format  : 
    version : 
    mime    : 
    class   : 
    basis   : 
    warning : 'no match'

% python3 trid.py OBSO0006.TAL 
TrID - File Identifier v2.41 - (C) 2003-2025 By M.Pontello

File: OBSO0006.TAL
       Unknown!

The Textor format is not known to PRONOM via Siegfried and also unknown to TrID, which now has a python release! I did go ahead and add the signature to Wikidata which can be used in Siegfried. If there is a need, we can submit to PRONOM as well.

% sf OBSO0006.TAL              
---
siegfried   : 1.11.2
scandate    : 2025-10-05T15:24:44-06:00
signature   : default.sig
created     : 2025-03-01T15:28:08+11:00
identifiers : 
  - name    : 'pronom'
    details : 'DROID_SignatureFile_V120.xml; container-signature-20240715.xml'
  - name    : 'wikidata'
    details : 'wikidata-definitions-4.0.0 (2025-10-05, DROID_SignatureFile_V120.xml, container-signature-20240715.xml)'
---
filename : 'OBSO0006.TAL'
filesize : 915
modified : 2025-10-05T13:26:56-06:00
errors   : 
matches  :
  - ns      : 'pronom'
    id      : 'UNKNOWN'
    format  : 
    version : 
    mime    : 
    class   : 
    basis   : 
    warning : 'no match'
  - ns        : 'wikidata'
    id        : 'Q136442756'
    format    : 'Textor document'
    URI       : 'http://www.wikidata.org/entity/Q136442756'
    permalink : 'https://www.wikidata.org/w/index.php?oldid=2413044878&title=Q136442756'
    mime      : 
    basis     : 'extension match tal; byte match at 0, 13 (Wikidata reference is empty)'
    warning   :

There is also a software tool, meant for converting Word Processing formats to modern and Mac compatible formats which was available until recently called WINCONV from MacDisk. This software will convert Textor 2/3/4/5/6 files to a text file for RTF. In the software it separates Textor 2/3 into their own group and 4, 5, and 6 into their own. Unfortunately doesn’t confirm any extensions that might be used.

I was able to find a copy of Textor 2.2.

It took me a few minutes to figure out some of the controls. Aside being in French, it was a little different than other Word Processing software.

After a bit of playing around in the software and trying many of the functions, I saved out a few files. At first, all the files were placed into a pair of files, called “TEXTOR.TEX” and “TEXTOR.LIG”. Creating a new document and saving would just update these two files. They seem to function in the same way the library function works in the Windows 6.0 version.

% hexdump -C TEXTOR.TEX | head
00000000  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
*
00000080  00 00 cd 00 60 00 05 00  00 00 54 45 00 00 00 00  |....`.....TE....|
00000090  00 00 00 00 00 00 54 45  58 54 4f 52 20 20 00 00  |......TEXTOR  ..|
000000a0  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
*
000000c0  00 00 cd 00 62 00 05 00  ff ff 54 45 00 00 00 00  |....b.....TE....|
000000d0  00 00 00 00 00 00 54 45  58 54 4f 52 20 20 00 00  |......TEXTOR  ..|
000000e0  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|

% hexdump -C TEXTOR.LIG | head 
00000000  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
*
00002f80  ff ff 1f 54 65 73 74 69  6e 67 0d 0a 0d 20 20 20  |...Testing...   |
00002f90  20 20 20 20 20 20 20 20  20 20 20 20 20 20 20 20  |                |
*
00003000  ff ff 1f 54 68 69 73 20  69 73 20 61 20 54 65 73  |...This is a Tes|
00003010  74 20 6f 66 20 54 65 78  74 6f 72 20 56 65 72 73  |t of Textor Vers|
00003020  69 6f 6e 20 32 2e 32 0d  0a 0d 20 20 20 20 20 20  |ion 2.2...      |
00003030  20 20 20 20 20 20 20 20  20 20 20 20 20 20 20 20  |

It seems the text portion of my document was saved in the LIG file and additional data, probably some description and user names into the TEX file. I then stumbled on a setup executable in the same directory that gave me some options.

THE TEXT DATABASE WILL BE CREATED ON THE DISK IN DRIVE (B)B
F1 – CREATING A TEXT DATABASE >1000 DOCUMENTS INACCESSIBLE BY MS-DOS
F2 – CREATING A TEXT DATABASE MANAGED BY MS-DOS (1 file per document)

Ok, so the software has two options. One for creating a database of text which we discovered above, and setting the software to create one file per document. When I selected F2, I was greeted with an error, which took me a minute to realize the first line required a disk to be in Drive B. Once I got it all configured I was able to save out a single file for a document.

% hexdump -C TEST02.BAT | head
00000000  1f 54 65 73 74 69 6e 67  20 32 6e 64 20 4f 70 74  |.Testing 2nd Opt|
00000010  69 6f 6e 0d 0a 1a 00 00  00 00 00 00 00 00 00 00  |ion.............|
00000020  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|

Not much to go on, the file is just full of plain ASCII other than a simple byte at the beginning and some new line bytes at the end. The BAT extension is a little unexpected, usually see those as batch scripts in DOS. Let’s try a more complex text document. More text, a tab, centering a line…..

% hexdump -C TXT223.BAT | head 
00000000  02 27 08 47 2d 49 2d 2d  2d 2d 2d 2d 2d 21 2d 2d  |.'.G-I-------!--|
00000010  2d 2d 2d 2d 2d 2d 2d 21  2d 2d 2d 2d 2d 2d 2d 2d  |-------!--------|
00000020  2d 21 2d 2d 2d 2d 2d 2d  2d 2d 2d 21 2d 2d 2d 2d  |-!---------!----|
00000030  2d 2d 2d 2d 2d 21 2d 2d  2d 2d 2d 2d 2d 2d 2d 21  |-----!---------!|
00000040  2d 2d 2d 2d 2d 2d 2d 2d  2d 21 2d 44 0d 0a 02 27  |---------!-D...'|
00000050  08 23 46 37 32 2c 30 30  2c 30 30 2c 30 30 2c 30  |.#F72,00,00,00,0|
00000060  30 2c 30 30 2c 34 30 2c  30 31 2c 30 31 2c 30 31  |0,00,40,01,01,01|
00000070  2c 30 32 2c 30 30 2c 37  32 2c 30 30 2c 30 30 2c  |,02,00,72,00,00,|
00000080  30 30 2c 30 30 2c 30 30  2c 30 32 2c 23 0d 0a 54  |00,00,00,02,#..T|
00000090  65 73 74 69 6e 67 20 73  6f 6d 65 20 6f 66 20 74  |esting some of t|

That gave me more to work with. But a bit of a mess. These seem to be more like some of the other earlier DOS word processing programs, they used ASCII, but embedded their own formatting codes throughout which only their software understood. This is why it is difficult to identify older WordStar or WordPerfect files.

This was a fun format to explore, I did learn a little French, but also had to dig deep to find the little information I was able to mention here. I would love to find a copy of Textor 4 or 5, which I believe are different than versions 2 & 3 and different than the Windows 6 version I have. There is one edition available on eBay currently, but seems to be the first version. If someone has the means in France this would be good to preserve. Feel free to look at the samples I made.

ACE

September 12, 2025 by Thor Leave a comment

Without divulging any youthful indiscretions, I recently was going back through some of my personal archives and came across a disc I burned around 2002 with some music stored on it. Normally I would find MP3 files, but in this case the file had a ACE extension. I remembered the format as an alternative to the common RAR or ZIP format often used to compress content for transporting (sharing) around the internet. I did what I normally do when something is compressed and reached for 7zip. But to my surprise, it threw an error.

% 7z l sample.ace 

Scanning the drive for archives:
1 file, 12501419 bytes (12 MiB)    

Listing archive: sample.ace


ERROR: sample.ace : Can not open the file as archive

7zip usually can handle most common archives but a part of me remembered there was two versions of WinACE back in the day. Version 1 which was a free version and Version 2 which was for paid users of WinACE. How do I know which version I have is the question I frequently find myself asking. First was to check the PRONOM registry.

% sf sample.ace 
---
siegfried   : 1.11.2
scandate    : 2025-09-11T09:01:25-06:00
signature   : default.sig
created     : 2025-03-01T15:28:08+11:00
identifiers : 
  - name    : 'pronom'
    details : 'DROID_SignatureFile_V120.xml; container-signature-20240715.xml'
---
filename : 'sample.ace'
filesize : 12501419
modified : 2025-09-11T09:04:36-06:00
errors   : 
matches  :
  - ns      : 'pronom'
    id      : 'UNKNOWN'
    format  : 
    version : 
    mime    : 
    class   : 
    basis   : 
    warning : 'no match'

Nope, this format is not known to PRONOM. Lets try another tool.

% file sample.ace 
sample.ace: ACE archive data version 20, from Win/32, version 20 to extract, solid

Ok, so the file tool knows it is a version 2 ACE file and requires version 2 to extract. Good info from a file identification tool. Now lets see what we can find to extract this file on MacOS. The website Winace.com is long gone as this compression tool lost popularity and the final release was over 14 years ago. Looking at the website in the WaybackMachine we can see some downloads available. One being UnACE for Mac OS X, which upon further review, only works for the older PowerPC Mac’s. There is an open source version of unace for Linux, but it only supports version 1, the free version of the format.

Below is a screenshot of the DOS version of the ACE software. Created by Marcel Lemke.

It might be good to mention that WinRAR used to support the ACE format, but with WinACE support ending years ago and with some new vulnerabilities and folks using it for malware, support was dropped in 2019.

Luckily, I still have my PowerMac G5 lying around waiting for this very situation. After a quick install, unace was able to unarchive my music and I was able to listen to some of my favorite songs from 23 years ago. I still wanted to find a modern solution and later discovered there is a python project which can read and extract bother versions. Acefile is a pure python, no-dependencies implementation of the UnACE format. I had a little issue installing on an older Catalina laptop, but worked well on later MacOS versions. Acefile has a few features that are helpful in not only extracting, but testing and dumping the headers of an ACE file. I did install WinACE in a Windows XP Virtual Machine to make a few samples, here is one of them.

% acefile-unace --test sample.ace 
success  test.tif
total 1 tested, 1 ok, 0 failed

The test feature works well to ensure the file is complete and can be extracted, but doesn’t give me much to go on for knowing the version. Lets try dumping the header.

% acefile-unace --header sample.ace 
volume
    filename    sample.ace
    filesize    12501419
    headers     MAIN:1 FILE:1 RECOVERY:0 others:0
header
    hdr_crc     0x4900
    hdr_size    44
    hdr_type    0x00        MAIN
    hdr_flags   0x8100      V20FORMAT|SOLID
    magic       b'**ACE**'
    eversion    20          2.0
    cversion    20          2.0
    host        0x02        Win32
    volume      0
    datetime    0x5b2aae37  2025-09-10 21:49:46
    reserved1   c8 51 62 e3 5b 80 00 00
    advert      b''
    comment     b''
    reserved2   b'\x00e\x9c\xb1\xd8\x00\x03\n\x00\x00@\x08\x00test.'
header
    hdr_crc     0x3626
    hdr_size    39
    hdr_type    0x01        FILE32
    hdr_flags   0x8001      ADDSIZE|SOLID
    packsize    12501328
    origsize    25264236
    datetime    0x5b2aadcd  2025-09-10 21:46:26
    attribs     0x00000080  NORMAL
    crc32       0x9290955a
    comptype    0x02        blocked
    compqual    0x03        normal
    params      0x000a
    reserved1   0x4000
    filename    b'test.tif'
    comment     b''
    ntsecurity  b''
    reserved2   b''

This is very helpful. We can see the output shows the magic bytes, but also the e(xtraction)version and c(creating)version. We can also find this information in the open source unace technical documentation.

       2      HEAD_CRC      CRC16 over block up from HEAD_TYPE
       2      HEAD_SIZE     size of the block from HEAD_TYPE
                              up to the last byte of this block

       1      HEAD_TYPE     archive header type is 0
       2      HEAD_FLAGS    contains most important information about the
                            archive

                               bit  discription

                                0   0  (no ADDSIZE field)
                                1   presence of a main comment

                                9   SFX-archive
                                10  dictionary size limited to 256K
                                    (because of a junior SFX)
                                11  archive consists of multiple volumes
                                12  main header contains AV-string
                                13  recovery record present
                                14  archive is locked
                                15  archive is solid

       7      ACESIGN       fixed string: '**ACE**' serves to find the
                              archive header

       1      VER_EXTRACT   version needed to extract archive
       1      VER_CREATED   version used to create the archive

I think we have enough to go on to create a signature, we just need to see what the 1 byte versions number look like in an actual file.

% hexdump -C sample.ace | head
00000000  00 49 2c 00 00 00 81 2a  2a 41 43 45 2a 2a 14 14  |.I,....**ACE**..|
00000010  02 00 37 ae 2a 5b c8 51  62 e3 5b 80 00 00 00 65  |..7.*[.Qb.[....e|
00000020  9c b1 d8 00 03 0a 00 00  40 08 00 74 65 73 74 2e  |........@..test.|
00000030  26 36 27 00 01 01 80 50  c1 be 00 6c 80 81 01 cd  |&6'....P...l....|
00000040  ad 2a 5b 80 00 00 00 5a  95 90 92 02 03 0a 00 00  |.*[....Z........|
00000050  40 08 00 74 65 73 74 2e  74 69 66 28 25 a4 89 04  |@..test.tif(%...|
00000060  fa 43 b1 05 49 0c a3 76  8e 16 a9 2c 92 44 34 8c  |.C..I..v...,.D4.|
00000070  2c 12 e7 28 67 68 49 69  a7 92 4a 10 07 da 10 16  |,..(ghIi..J.....|
00000080  9c 16 4a 10 07 2b 9c ae  30 a9 50 c4 0a 69 51 a6  |..J..+..0.P..iQ.|
00000090  c9 64 a7 24 09 93 3d 81  26 31 a9 c2 68 32 c1 33  |.d.$..=.&1..h2.3|

As you can see above, we have our magic bytes **ACE** starting at the seventh byte and taking up seven bytes. Then two bytes after it both with the hex value 14. If we convert that hex value to decimal we get “20”. Let’s look at another:

% hexdump -C sample2.ace | head
00000000  61 67 31 00 00 00 90 2a  2a 41 43 45 2a 2a 0a 0c  |ag1....**ACE**..|
00000010  02 00 50 7c 31 26 d7 2b  c0 48 af 83 ce d9 16 2a  |..P|1&.+.H.....*|
00000020  55 4e 52 45 47 49 53 54  45 52 45 44 20 56 45 52  |UNREGISTERED VER|
00000030  53 49 4f 4e 2a 34 5f 24  00 01 01 80 00 00 00 00  |SION*4_$........|
00000040  35 00 00 00 3c 7c 31 26  10 00 00 00 ff ff ff ff  |5...<|1&........|
00000050  01 05 0a 00 2a 55 05 00  61 75 64 69 6f 45 72 23  |....*U..audioEr#|
00000060  00 01 01 80 00 00 00 00  35 00 00 00 3c 7c 31 26  |........5...<|1&|
00000070  10 00 00 00 ff ff ff ff  01 05 0a 00 2a 55 04 00  |............*U..|
00000080  42 49 54 53 98 14 24 00  01 01 80 00 00 00 00 35  |BITS..$........5|
00000090  00 00 00 3c 7c 31 26 10  00 00 00 ff ff ff ff 01  |...<|1&.........|

Hmm, now we have two different values. “0A” converts to decimal “10” and “0C” converts to decimal “12”. So we can infer this ACE file was created in version 1.2 and requires at least version 1.0 to extract. Let’s try another:

% hexdump -C sample3.ace | head   
00000000  c0 3f 2c 00 00 00 81 2a  2a 41 43 45 2a 2a 0a 14  |.?,....**ACE**..|
00000010  02 00 dc ad 2a 5b 23 52  89 e0 5b 80 00 00 00 65  |....*[#R..[....e|
00000020  9c b1 d8 00 03 0a 00 00  40 08 00 74 65 73 74 2e  |........@..test.|
00000030  92 f3 27 00 01 01 80 54  c3 be 00 6c 80 81 01 cd  |..'....T...l....|
00000040  ad 2a 5b 80 00 00 00 5a  95 90 92 01 03 0a 00 00  |.*[....Z........|
00000050  40 08 00 74 65 73 74 2e  74 69 66 28 25 a4 89 04  |@..test.tif(%...|
00000060  fa 43 b1 05 49 0c a3 76  8e 16 a9 2c 92 44 34 8c  |.C..I..v...,.D4.|
00000070  2c 12 e7 28 67 68 49 69  a7 92 4a 10 07 da 10 16  |,..(ghIi..J.....|
00000080  9c 16 4a 10 07 2b 9c ae  30 a9 50 c4 0a 69 51 a6  |..J..+..0.P..iQ.|
00000090  c9 64 a7 24 09 93 3d 81  26 31 a9 c2 68 32 c1 33  |.d.$..=.&1..h2.3|

Again we have “0A” which converts to decimal “10” and hex 14, which converts to decimal “20”. So made with version 2.0 of the software, but made compatible with version 1.0 for extraction. One more:

% hexdump -C sample4.ace | head
00000000  8b d6 31 00 00 00 90 2a  2a 41 43 45 2a 2a 0b 0b  |..1....**ACE**..|
00000010  02 00 cd b4 3e 26 4a e3  a1 80 32 4b c1 d9 16 2a  |....>&J...2K...*|
00000020  55 4e 52 45 47 49 53 54  45 52 45 44 20 56 45 52  |UNREGISTERED VER|
00000030  53 49 4f 4e 2a aa 08 24  00 01 01 00 00 00 00 00  |SION*..$........|
00000040  00 00 00 00 83 b2 3e 26  10 00 00 00 ff ff ff ff  |......>&........|
00000050  01 05 0a 00 2a 55 05 00  4d 75 73 69 63 77 73 27  |....*U..Musicws'|
00000060  00 01 01 00 00 00 00 00  00 00 00 00 83 b2 3e 26  |..............>&|
00000070  10 00 00 00 ff ff ff ff  01 05 0a 00 2a 55 08 00  |............*U..|
00000080  52 65 73 6f 75 72 63 65  93 75 25 00 01 01 00 00  |Resource.u%.....|
00000090  00 00 00 00 00 00 00 83  b2 3e 26 10 00 00 00 ff  |.........>&.....|

Both extraction and creation version are hex “0B” which converts to decimal “11”. I would have assumed any version 1.0 version could extract anything created with later 1.x versions, but I guess that might not be true. I am not clear on all the versions released, so I am not sure how many versions I should include in a signature. I did look through some of the captured pages on the WayBackMachine and feel the last 1.x version was version 1.32.

When building these signatures, it should be easy to create two signatures based on their extraction version. But should the creation version be a factor? Version 1.0 could look like this:

2A2A4143452A2A(0A|0B|0C|0D)(0A|0B|0C|0D|14)

This accounts for the versions 1.0 through 1.3 for extract version and 1.0 through 2.0 for creation version. Version 2.0 doesn’t seem to indicate minor versions with all 2.0 versions using decimal 14. So a signature could be:

2A2A4143452A2A1414

Both would start from offset 7 from the beginning of the file. Is there a better solution?

I will warn you, there are a couple of ACE formats out there which you may come across. One being an image/texture format for Microsoft Train Simulator. That might be for another day. There is another use of the ACE archive which is worth discussing. The Comic Book Archive file with the extension CBA will use the ACE archive for storing a series of images used in some Comic Book Readers. They are indeed ACE archive files, only having the different extension and a specific purpose. Maybe adding the CBA extension to the signature would be sufficient?

I am sure there are some other properties, seen above, of the ACE format we could discuss, encryption, the differences between Solid and SFX, and dictionary headers, but I think for now, identification of the format and the main version difference is sufficient. For now, check out my Github page for my signature proposal and a few samples I made.

Page Perfect

August 22, 2025 by Thor Leave a comment

PagePerfect: the Promise of Desktop Publishing Realized

Now, PagePerfect has arrived. And suddenly PC desktop publishing is a lot
simpler and less expensive, because PagePerfect integrates desktop
publishing, word processing, and graphics editing all in one package.

The 1980’s was a time of growth in personal computing and one industry was progressing rapidly. Previously in order to get printed more than just words, you had to use a complex arrangement of type, masking, screening; all done by hand. Now with a personal computer you could design and print well designed layouts. There were many software applications who came on the scene in these early days. My personal favorite was QuarkXPress, I used the software in the early 1990’s and spent the next few years working in a commercial printshop using the software. What once took a team of skilled workers to set copy, mask, blueline, etc took only one person with the right software.

I recently came across a set of floppy disks for some software called PagePerfect, by a well known software company IMSI.

This article in a 1988 PC Magazine announces this new revolutionary software. This was early on in the days of computer desktop publishing and even on a DOS system the software was powerful. It didn’t always get the best reviews in terms of ease of use, but it was well built. The company behind this powerful software wasn’t IMSI as you might expect, it was programed by a different company, Beyond Words, started by three former MicroPro employees, the makers of WordStar. Beyond Words liked to “leave sales to others” which included IMSI and a big contract with Canon called their Desktop Publishing System.

IMSI was able to market the software well and was well priced. The name PagePerfect didn’t last long and soon after they renamed the software IMSI Publisher in 1989. I’m not 100% sure, but it might have to do with WordPerfect asserting some copyright to the name around that same time. By 1990, the software was not seen much anymore, but another name pops up, Beyond Words Composer 2.0.

All three versions of the software have a very similar interface.

But the one thing they all have in common is their file formats. Unfortunately they used the same extensions many word processing software used during this time and after. .DOC and also .STY which was used frequently by Microsoft Word as well. It makes sense, a Document is shortened to DOC and a Stylesheet is shortened to STY. So if you have any DOC files which don’t open in Word, you might look here. The other problem is the file format used is not plain text and is in a binary proprietary format.

hexdump -C TEST.DOC | head
00000000  5b 42 57 44 42 5d 00 00  00 00 00 31 2e 30 30 00  |[BWDB].....1.00.|
00000010  00 00 00 00 00 00 3c af  13 5b 1e 00 00 00 95 63  |......<..[.....c|
00000020  00 00 5e 00 00 00 18 00  00 00 01 00 76 00 00 00  |..^.........v...|
00000030  68 01 00 00 0a 00 de 01  00 00 00 00 00 00 00 00  |h...............|
00000040  de 01 00 00 8b 60 00 00  1e 00 69 62 00 00 2c 01  |.....`....ib..,.|
00000050  00 00 1e 00 00 00 00 00  00 00 00 00 00 00 5b 42  |..............[B|
00000060  57 44 4f 43 5d 00 00 00  00 32 2e 30 39 00 00 00  |WDOC]....2.09...|
00000070  00 00 00 00 0a 00 00 00  00 00 00 00 00 00 00 00  |................|
00000080  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
00000090  00 00 00 00 6c 00 00 00  00 00 00 00 00 00 00 00  |....l...........|

The one positive is the very obvious strings of text in the header. [BWDB] and [BWDOC], which one could infer as Beyond Words DB and Beyond Words Document. A later Beyond Words Composer document has the same header but a higher version number.

hexdump -C WELCOME.DOC | head
00000000  5b 42 57 44 42 5d 00 00  00 00 00 31 2e 30 30 00  |[BWDB].....1.00.|
00000010  00 00 00 00 00 00 aa 14  56 16 29 00 00 00 30 84  |........V.)...0.|
00000020  00 00 5e 00 00 00 18 00  00 00 01 00 76 00 00 00  |..^.........v...|
00000030  b0 01 00 00 0c 00 26 02  00 00 00 00 00 00 00 00  |......&.........|
00000040  26 02 00 00 70 80 00 00  29 00 96 82 00 00 9a 01  |&...p...).......|
00000050  00 00 29 00 00 00 00 00  00 00 00 00 00 00 5b 42  |..)...........[B|
00000060  57 44 4f 43 5d 00 00 00  00 33 2e 30 31 00 00 00  |WDOC]....3.01...|
00000070  00 00 00 00 0c 00 00 00  00 00 00 00 00 00 00 00  |................|
00000080  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
00000090  00 00 00 00 6e 00 00 00  00 00 00 00 00 00 00 00  |....n...........|

If we look at the Stylesheets we see the same patterns.

hexdump -C SAMPLE.STY | head   
00000000  5b 42 57 44 42 5d 00 00  00 00 00 31 2e 30 30 00  |[BWDB].....1.00.|
00000010  00 00 00 00 00 00 51 10  76 10 09 00 00 00 da 2c  |......Q.v......,|
00000020  00 00 5e 00 00 00 18 00  00 00 01 00 76 00 00 00  |..^.........v...|
00000030  68 01 00 00 0a 00 de 01  00 00 00 00 00 00 00 00  |h...............|
00000040  de 01 00 00 a2 2a 00 00  09 00 80 2c 00 00 5a 00  |.....*.....,..Z.|
00000050  00 00 09 00 00 00 00 00  00 00 00 00 00 00 5b 42  |..............[B|
00000060  57 44 4f 43 5d 00 00 00  00 32 2e 30 39 00 00 00  |WDOC]....2.09...|
00000070  00 00 00 00 0a 00 00 00  00 00 00 00 00 00 00 00  |................|
00000080  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
00000090  00 00 00 00 6c 00 00 00  00 00 00 00 00 00 00 00  |....l...........|

I haven’t been able to find any specific bytes which differentiate the Stylesheets from the Documents. They may be the same format, but for now we will consider them the same. These stylesheets seem to function as a template which are often the same format.

Apart from the document layout, the software can also create and use databases. Which appear to be a similar format but with different offsets.

hexdump -C DOCUMENT.TBL | head
00000000  5b 42 57 44 42 5d 00 00  00 00 00 31 2e 30 30 00  |[BWDB].....1.00.|
00000010  00 00 00 00 00 00 6b 10  36 00 00 00 18 00 00 00  |......k.6.......|
00000020  01 00 4e 00 00 00 68 01  00 00 0a 00 b6 01 00 00  |..N...h.........|
00000030  00 00 00 00 00 00 5b 42  57 44 4f 43 5d 00 00 00  |......[BWDOC]...|
00000040  00 32 2e 30 39 00 00 00  00 00 00 00 0a 00 00 00  |.2.09...........|
00000050  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
00000060  00 00 00 00 00 00 00 00  00 00 00 00 6c 00 00 00  |............l...|
00000070  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|

Prior to me diving into this format, the only tool which had some information on this format was TrID, which identified all the DOC and STY files as Beyond Words Composer style. Which is mostly true. Hopefully with this background you can be aware of the different software names this format was used with and with some luck convert the files to something less proprietary.

Some disks that came with my PagePerfect install disks do have some personal documents created with the software, but I wonder how much this software really was used in the late 1980’s and early 1990’s, because after that point, you don’t hear about the software anymore. There is some references to the software getting absorbed into another software, IBM DisplayWrite 5/2. I would be curious if others have come across this file format.