Textor

Many of us lived through the Word Processing Wars of the late 1980’s and early 1990’s. It was an overwhelming time of many options to choose from, each providing new features with each update, trying to become the leader in the word processing game. Early DOS versions had steep learning curves which built loyalty to those who committed to muscle memory all the key commands needed to produce the perfect document. With the many options to choose for word processing, brought just as many file formats to save your work. Many titles used the same file extensions or encouraged users to choose their own, using their initials instead. Often the files created by these software titles, used standard ASCII text, but mixed in their own formatting codes which all tend to make identification in preservation difficult.

I recently acquired a large lot of older software. It has been fun sorting through it and learning about the different titles. One title stuck out, as I hadn’t heard of it before. I found an old article which included the software in a comparison of word processing software in 1993. The article compares the following executive word processing software.

  • LotusWrite 2.0
  • JustWrite 2.0
  • Professional Write Plus 1.0
  • CA-Textor 6.0
  • Ami Pro 3.0
  • Word for Windows 2.0a
  • WordPerfect 5.1

You are probably familiar with a few of these titles, but the one that stuck out to me was CA-Textor 6.0. In my lot of software I came across a two disk installer for CA-Textor 6.0 for Windows. Developed by Computer Associates International, Inc. who opened their doors in 1976 and developed or acquired many software titles.

In the case of CA-Textor, it was purchased from a French company, Talor à Paris, who had been producing Textor, a popular word processor in France, for DOS since the 1983. The original developer, Thierry Lorthiois, had high hopes for a French product to exist in a world of giant American companies. Even with over 70,000 copies sold, the release of Textor 4 in 1988 saw much marketshare lost to Microsoft Word. By 1989, Computer Associates purchased Textor and rebranded Textor 5 for DOS and added mouse compatibility, then in late 1991 released a Windows version of Textor and named it CA-Textor; in line with their other products. It would be the only version released by Computer Associates and disappeared into the void like many word processors of the time.

CA-Textor 6.0 for Windows appears to be a well designed word process for its time. The reviews were mixed, but scored decently in many comparisons. In the article mentioned above, it scored the lowest of all the word processors. The final result says:

CA-Textor fails to offer the usability shortcuts of the other programs, and scores well below the other programs in editing, formatting and graphics manipulation.

It was possibly reviews like this which caused Computer Associates to never update or release a new version of the software.

The first thing I noticed with the software was the way the software handles files. The software defaults to a new “Library” method which stores each file connected to a Library which stores a folder of files and their full names and descriptions.

Single files can still be saved from CA-Textor by choosing DOS file, but the extension used is not clear.

Using .TXT for a formatted file seems like a bad recommendation. So let’s take a look at a few of the files generated by by CA-Textor.

The new Library File has the extension .TAL.

 % hexdump -C TEXTOR.TAL | head
00000000 43 3a 5c 54 45 58 54 4f 52 5c 54 45 58 54 4f 52 |C:\TEXTOR\TEXTOR|
00000010 2e 54 41 4c 00 00 00 00 00 00 00 00 00 00 00 00 |.TAL............|
00000020 00 00 00 00 00 00 00 00 00 54 45 58 54 4f 52 00 |.........TEXTOR.|
00000030 00 00 c1 46 8d ec 1a 47 8d ec c1 46 8d ec 05 00 |...F...G...F....|
00000040 01 00 01 00 00 00 00 00 00 00 00 00 00 ff ff 54 |...............T|
00000050 42 58 54 66 00 0a 00 00 00 00 00 65 00 00 00 00 |BXTf.......e....|
00000060 00 00 00 00 00 54 65 73 74 00 00 00 00 00 00 00 |.....Test.......|
00000070 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 |................|
00000080 00 00 00 00 00 00 00 00 00 00 00 00 00 00 54 45 |..............TE|
00000090 58 54 4f 52 00 00 00 e9 46 8d ec e9 46 8d ec 01 |XTOR....F...F...|

The first few bytes is the path of the file.

It also seems the individual files connected to the Library also have the .TAL extension. But they have a different header.

% hexdump -C OBSO0006.TAL | head
00000000 01 14 00 45 54 01 02 58 54 00 06 49 57 01 00 00 |...ET..XT..IW...|
00000010 00 00 00 65 00 87 16 06 80 00 4f 62 73 6f 6c 65 |...e......Obsole|
00000020 74 65 54 68 6f 72 00 00 00 00 00 00 00 00 00 00 |teThor..........|
00000030 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 |................|
00000040 00 00 00 4f 62 73 6f 6c 65 74 65 54 68 6f 72 00 |...ObsoleteThor.|
00000050 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 |................|
00000060 00 00 00 00 00 00 00 00 00 00 00 00 54 45 58 54 |............TEXT|
00000070 4f 52 00 00 00 f4 b0 8e ec 1d b1 8e ec 00 00 00 |OR..............|
00000080 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 |................|
00000090 00 00 00 00 00 00 00 00 00 00 07 11 00 02 00 00 |................|

The CA-Textor software installed some sample files during installation we can also look at.

% hexdump -C SAMPLE01.SAM | head
00000000 01 14 00 45 54 01 02 58 54 00 06 49 57 3b 00 00 |...ET..XT..IW;..|
00000010 00 da 05 00 00 04 01 06 80 00 44 3a 5c 44 4f 43 |..........D:\DOC|
00000020 54 5c 53 41 4d 50 4c 45 30 31 2e 53 41 4d 00 00 |T\SAMPLE01.SAM..|
00000030 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 |................|
00000040 00 00 00 44 4f 53 00 00 00 00 00 00 00 00 00 00 |...DOS..........|
00000050 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 |................|
00000060 00 00 00 00 00 00 00 00 00 00 00 00 43 72 61 69 |............Crai|
00000070 67 00 00 00 00 34 24 b5 29 a1 8a b5 29 00 00 00 |g....4$.)...)...|
00000080 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 |................|
00000090 00 00 00 00 00 00 00 00 00 00 07 61 00 02 00 00 |...........a....|

A later build of CA-Textor had more sample files, but this time with a different extension. Also has the same bytes at the beginning of the file.

% hexdump -C TEMP0005.TEM | head
00000000 01 14 00 45 54 01 02 58 54 00 06 49 57 22 00 00 |...ET..XT..IW"..|
00000010 00 e0 40 12 00 10 00 06 80 00 46 61 78 20 43 6f |..@.......Fax Co|
00000020 76 65 72 20 31 00 00 00 00 00 00 00 00 00 00 00 |ver 1...........|
00000030 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 |................|
00000040 00 00 00 54 65 6d 70 6c 61 74 65 73 00 00 00 00 |...Templates....|
00000050 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 |................|
00000060 00 00 00 00 00 00 00 00 00 00 00 00 54 45 58 54 |............TEXT|
00000070 4f 52 00 00 00 12 04 d7 2b 5d 80 e1 2b 00 00 00 |OR......+]..+...|
00000080 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 |................|
00000090 00 00 00 00 00 00 00 00 00 00 10 3d 02 01 00 00 |...........=....|

The good news is there is a pattern emerging, but not the same extension. I get the feeling they didn’t see much value in the extension for this software. When I save a file in the software as a DOS file, it doesn’t automatically pick an extension for me. I left the extension off and saved a file in the DOS format.

% hexdump -C TEST1 | head
00000000 01 14 00 45 54 01 02 58 54 00 06 49 57 1d 00 00 |...ET..XT..IW...|
00000010 00 2f 67 d8 6e 1d 00 06 80 00 43 3a 5c 54 45 58 |./g.n.....C:\TEX|
00000020 54 4f 52 5c 54 45 53 54 31 00 00 00 00 00 00 00 |TOR\TEST1.......|
00000030 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 |................|
00000040 00 00 00 44 4f 53 00 00 00 00 00 00 00 00 00 00 |...DOS..........|
00000050 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 |................|
00000060 00 00 00 00 00 00 00 00 00 00 00 00 54 45 58 54 |............TEXT|
00000070 4f 52 00 00 00 c0 ce d0 ae 99 53 8d ec 00 00 00 |OR........S.....|
00000080 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 |................|
00000090 00 00 00 00 00 00 00 00 00 00 07 0d 00 02 00 00 |................|

We see the same pattern at the head, but also a clear mention of DOS, just like the sample files included. Since I don’t have any earlier DOS versions to compare, I have to assume this is the same with at least Textor 5. I did find a mention of someone trying to convert their older Textor 5 documents to modern formats and they mention they are in the TAL format.

% sf OBSO0006.TAL 

filename : 'OBSO0006.TAL'
filesize : 915
modified : 2025-10-05T13:26:56-06:00
errors :
matches :
- ns : 'pronom'
id : 'UNKNOWN'
format :
version :
mime :
class :
basis :
warning : 'no match'

% python3 trid.py OBSO0006.TAL
TrID - File Identifier v2.41 - (C) 2003-2025 By M.Pontello

File: OBSO0006.TAL
Unknown!

The Textor format is not known to PRONOM via Siegfried and also unknown to TrID, which now has a python release! I did go ahead and add the signature to Wikidata which can be used in Siegfried. If there is a need, we can submit to PRONOM as well.

% sf OBSO0006.TAL              
---
siegfried : 1.11.2
scandate : 2025-10-05T15:24:44-06:00
signature : default.sig
created : 2025-03-01T15:28:08+11:00
identifiers :
- name : 'pronom'
details : 'DROID_SignatureFile_V120.xml; container-signature-20240715.xml'
- name : 'wikidata'
details : 'wikidata-definitions-4.0.0 (2025-10-05, DROID_SignatureFile_V120.xml, container-signature-20240715.xml)'
---
filename : 'OBSO0006.TAL'
filesize : 915
modified : 2025-10-05T13:26:56-06:00
errors :
matches :
- ns : 'pronom'
id : 'UNKNOWN'
format :
version :
mime :
class :
basis :
warning : 'no match'
- ns : 'wikidata'
id : 'Q136442756'
format : 'Textor document'
URI : 'http://www.wikidata.org/entity/Q136442756'
permalink : 'https://www.wikidata.org/w/index.php?oldid=2413044878&title=Q136442756'
mime :
basis : 'extension match tal; byte match at 0, 13 (Wikidata reference is empty)'
warning :

There is also a software tool, meant for converting Word Processing formats to modern and Mac compatible formats which was available until recently called WINCONV from MacDisk. This software will convert Textor 2/3/4/5/6 files to a text file for RTF. In the software it separates Textor 2/3 into their own group and 4, 5, and 6 into their own. Unfortunately doesn’t confirm any extensions that might be used.

I was able to find a copy of Textor 2.2.

It took me a few minutes to figure out some of the controls. Aside being in French, it was a little different than other Word Processing software.

After a bit of playing around in the software and trying many of the functions, I saved out a few files. At first, all the files were placed into a pair of files, called “TEXTOR.TEX” and “TEXTOR.LIG”. Creating a new document and saving would just update these two files. They seem to function in the same way the library function works in the Windows 6.0 version.

% hexdump -C TEXTOR.TEX | head
00000000 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 |................|
*
00000080 00 00 cd 00 60 00 05 00 00 00 54 45 00 00 00 00 |....`.....TE....|
00000090 00 00 00 00 00 00 54 45 58 54 4f 52 20 20 00 00 |......TEXTOR ..|
000000a0 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 |................|
*
000000c0 00 00 cd 00 62 00 05 00 ff ff 54 45 00 00 00 00 |....b.....TE....|
000000d0 00 00 00 00 00 00 54 45 58 54 4f 52 20 20 00 00 |......TEXTOR ..|
000000e0 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 |................|

% hexdump -C TEXTOR.LIG | head
00000000 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 |................|
*
00002f80 ff ff 1f 54 65 73 74 69 6e 67 0d 0a 0d 20 20 20 |...Testing... |
00002f90 20 20 20 20 20 20 20 20 20 20 20 20 20 20 20 20 | |
*
00003000 ff ff 1f 54 68 69 73 20 69 73 20 61 20 54 65 73 |...This is a Tes|
00003010 74 20 6f 66 20 54 65 78 74 6f 72 20 56 65 72 73 |t of Textor Vers|
00003020 69 6f 6e 20 32 2e 32 0d 0a 0d 20 20 20 20 20 20 |ion 2.2... |
00003030 20 20 20 20 20 20 20 20 20 20 20 20 20 20 20 20 |

It seems the text portion of my document was saved in the LIG file and additional data, probably some description and user names into the TEX file. I then stumbled on a setup executable in the same directory that gave me some options.

THE TEXT DATABASE WILL BE CREATED ON THE DISK IN DRIVE (B)B
F1 – CREATING A TEXT DATABASE >1000 DOCUMENTS INACCESSIBLE BY MS-DOS
F2 – CREATING A TEXT DATABASE MANAGED BY MS-DOS (1 file per document)

Ok, so the software has two options. One for creating a database of text which we discovered above, and setting the software to create one file per document. When I selected F2, I was greeted with an error, which took me a minute to realize the first line required a disk to be in Drive B. Once I got it all configured I was able to save out a single file for a document.

% hexdump -C TEST02.BAT | head
00000000 1f 54 65 73 74 69 6e 67 20 32 6e 64 20 4f 70 74 |.Testing 2nd Opt|
00000010 69 6f 6e 0d 0a 1a 00 00 00 00 00 00 00 00 00 00 |ion.............|
00000020 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 |................|

Not much to go on, the file is just full of plain ASCII other than a simple byte at the beginning and some new line bytes at the end. The BAT extension is a little unexpected, usually see those as batch scripts in DOS. Let’s try a more complex text document. More text, a tab, centering a line…..

% hexdump -C TXT223.BAT | head 
00000000 02 27 08 47 2d 49 2d 2d 2d 2d 2d 2d 2d 21 2d 2d |.'.G-I-------!--|
00000010 2d 2d 2d 2d 2d 2d 2d 21 2d 2d 2d 2d 2d 2d 2d 2d |-------!--------|
00000020 2d 21 2d 2d 2d 2d 2d 2d 2d 2d 2d 21 2d 2d 2d 2d |-!---------!----|
00000030 2d 2d 2d 2d 2d 21 2d 2d 2d 2d 2d 2d 2d 2d 2d 21 |-----!---------!|
00000040 2d 2d 2d 2d 2d 2d 2d 2d 2d 21 2d 44 0d 0a 02 27 |---------!-D...'|
00000050 08 23 46 37 32 2c 30 30 2c 30 30 2c 30 30 2c 30 |.#F72,00,00,00,0|
00000060 30 2c 30 30 2c 34 30 2c 30 31 2c 30 31 2c 30 31 |0,00,40,01,01,01|
00000070 2c 30 32 2c 30 30 2c 37 32 2c 30 30 2c 30 30 2c |,02,00,72,00,00,|
00000080 30 30 2c 30 30 2c 30 30 2c 30 32 2c 23 0d 0a 54 |00,00,00,02,#..T|
00000090 65 73 74 69 6e 67 20 73 6f 6d 65 20 6f 66 20 74 |esting some of t|

That gave me more to work with. But a bit of a mess. These seem to be more like some of the other earlier DOS word processing programs, they used ASCII, but embedded their own formatting codes throughout which only their software understood. This is why it is difficult to identify older WordStar or WordPerfect files.

This was a fun format to explore, I did learn a little French, but also had to dig deep to find the little information I was able to mention here. I would love to find a copy of Textor 4 or 5, which I believe are different than versions 2 & 3 and different than the Windows 6 version I have. There is one edition available on eBay currently, but seems to be the first version. If someone has the means in France this would be good to preserve. Feel free to look at the samples I made.

FASTA & FASTQ

There seems to be a never ending source of file formats out there. Documenting past obsolete formats, one would assume a point at which there are no more to find, but in reality more are re-discovered everyday by the Digital Preservation community. When it comes to more modern formats, it seems more are invented everyday, too many to keep up with identification. Document one, 10 more pop up, it seems never-ending. Such is the case for scientific formats, including sequencing formats.

I was speaking with a colleague from another institution the other day and a file format was mentioned I hadn’t heard of before. It seems many of their scientific data was stored in a format called FASTA “Fast A” (“fast-aye”). This format specifically stores DNA sequence data and is used quite a bit, it seems. I was even more surprised the next day when I went to process some new submissions for our repository only to find one submission contained three FASTA files. I love researching file formats, but sometimes in order to understand the format structure you have to know something about the content as well. Let’s explore the FASTA and FASTQ file formats. If you would like to take a peek at the Human Genome in FASTA, go here.

Both the FASTA and FASTQ formats are text based and have a simple structure. Identification of each of these should be pretty simple, but to avoid conflicts with other formats, the signature might have to be more complex.

The FASTA format is well documented as many in the scientific community use it. Basically the format starts with the greater than “>” character followed by a description, a new line character, then the sequence. For example:

>MCHU - Calmodulin - Human, rabbit, bovine, rat, and chicken
MADQLTEEQIAEFKEAFSLFDKDGDGTITTKELGTVMRSLGQNPTEAELQDMINEVDADGNGTID
FPEFLTMMARKMKDTDSEEEIREAFRVFDKDGNGYISAAELRHVMTNLGEKLTDEEVDEMIREA
DIDGDGQVNYEEFVQMMTAK*

Pretty straight forward, but so much of the format can be variable, a simple signature would clash with too many other formats. There are some rules with what characters can be used in the sequence so it might be possible to limit the signature to only allow certain characters. At first I thought it might only be able to contain the standard characters representative of adenine (A), cytosine (C), guanine (G), and thymine (T), but as it turns out the FASTA format can contain Nucleic Acid Code’s and Amino Acid Code’s. These codes allow more than the four I was expecting, but do limit what can be represented.

Take the NCBI Sequence Viewer for a spin and download some data as FASTA.

The FASTQ format adds more structure and is more limiting, but also presents some challenges. Here is a sample of its structure:

@SEQ_ID
GATTTGGGGTTCAAAGCAGTATCGATCAAATAGTAAATCCATTTGTTCAACTCACAGTTT
+
!''*((((***+))%%%++)(%%%%).1***-+*''))**55CCF>>>>>>CCCCCCC65

Instead of a greater than symbol, the FASTQ format uses an “@” symbol followed by an identifier. The identifier can be basically anything and as long as needed. There is a newline character followed by the DNA sequence, which is only the four characters I have heard before. It can contain an A, C, G, T, or N. The “N” can represent an unidentified nucleotide or indicate that the software was unable to make a basecall. A newline character again and the “+” symbol. This is place before the fourth line with is a quality score and is the same number of characters as the sequence.

See what I mean when you have to learn about the context of a format in order to make a proper signature!

One of the problems I am left with is how to determine how many of the sequence characters to use in the signature to not have any conflicts. Too few and it might conflict with another format or simple text file. Too many and the signature gets complicated and may exclude a short sequence file. As far as I can tell there is no set minimum or maximum for the sequence. Not sure what the genome for Pinus Taeda would look like in FASTA with 22.18 billion base pairs. The other problem is often times these formats are compressed into a GZIP file, so they need to be extracted before identification.

These two formats are just a couple of the many sequencing formats being used in the bioinformatics community. I am sure others will pop up in the future. Until then, I have with the help of others put together a signature which seems to work well for the samples and data sets we have access to. Take a look at my GitHub for the signature proposal. If you find any issues, let me know!

AskSam

I was recently asked to look at a set of files with the extension of .ASK. A quick little search led me to find they belong to AskSam which was a free-form database software often used by researchers and libraries as early as 1985. The first few versions of Access Stored Knowledge via Symbolic Access Method were released for DOS and later Windows. The company askSam Systems disappeared around 2015.

The AskSam software competed with other personal information managers with unstructured data storage and retrieval. It was used to keep track of e-mail, special collections, letters, articles, web sites, etc. It could index all the contents and make searching and retrieval easy. By setting up fields the data could be exported to delimitated text. The software also appears to have been localized in German, but file format is the same.

AskSam had many import filters which included:

  • Microsoft Word
  • WordPerfect
  • Text (ASCII files)
  • HTML Files (from the Internet)
  • RTF Files (Rich Text Format)
  • Eudora E-Mail
  • Microsoft Outlook
  • Microsoft Outlook Express
  • Text delimited files – Comma Separated Values, Fixed position, etc.
  • dBASE
  • FoxPro
  • Paradox
  • Microsoft Access
  • Microsoft Excel

AskSam has its own proprietary format to store the database using the .ASK extension. They appear to have a 256 byte header. All the DOS versions of the software use the simple BOF string of “askSam”.

hexdump -C TEST.ASK       
00000000  61 73 6b 53 61 6d 00 00  00 00 00 07 0f 01 00 00  |askSam..........|
00000010  01 00 00 00 00 01 00 05  00 37 00 02 00 00 00 01  |.........7......|
00000020  33 00 32 00 00 00 00 00  50 00 00 00 00 00 00 00  |3.2.....P.......|
00000030  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
*
000000d0  00 14 00 01 00 00 01 00  00 00 00 00 00 00 00 00  |................|
000000e0  00 00 00 00 00 01 00 00  00 00 03 1d 42 00 01 00  |............B...|
000000f0  00 13 01 00 00 00 00 01  00 00 00 00 00 00 00 00  |................|
00000100  00 00 00 00 f6 00 00 00  00 54 65 73 74 01 01 01  |.........Test...|
00000110  01 01 00  

When the first Windows version came out in 1993, the header changed to the logical string:

hexdump -C DOS-WIN.ASK | head
00000000  61 73 6b 77 69 6e 00 00  00 00 00 07 0f 01 00 04  |askwin..........|
00000010  01 00 00 00 01 01 00 05  01 37 03 00 00 00 00 01  |.........7......|
00000020  64 00 32 2e 01 4e 00 00  a0 00 00 00 00 00 00 00  |d.2..N..........|
00000030  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
00000040  00 00 00 00 00 00 00 00  00 00 00 00 00 76 43 00  |.............vC.|
00000050  00 8c 00 00 00 00 00 00  00 00 00 00 00 01 00 00  |................|
00000060  00 01 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
00000070  00 41 72 69 61 6c 00 72  20 4e 65 77 00 00 00 00  |.Arial.r New....|
00000080  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
00000090  00 00 00 00 00 00 00 00  00 00 5b 3a 00 10 10 10  |..........[:....|

With Version 2 for Windows we start seeing a slightly different header:

hexdump -C AS2W-S01.ASK 
00000000  61 73 6b 57 69 53 00 00  00 00 00 07 0f 01 00 04  |askWiS..........|
00000010  01 00 00 00 01 01 00 05  00 37 03 00 00 00 00 01  |.........7......|
00000020  c8 00 32 2f 02 4c 00 00  a0 00 00 00 00 00 00 00  |..2/.L..........|
00000030  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
*
00000070  00 43 6f 75 72 69 65 72  20 4e 65 77 00 00 00 00  |.Courier New....|
00000080  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
00000090  00 00 00 00 00 00 00 00  00 00 5b 3a 00 10 10 14  |..........[:....|
000000a0  14 02 00 00 0a 00 00 00  00 00 00 00 00 00 00 00  |................|
000000b0  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
000000c0  00 00 00 00 00 00 00 00  00 00 00 60 00 00 00 00  |...........`....|
000000d0  05 00 00 00 00 00 01 00  00 00 00 00 00 00 00 00  |................|
000000e0  00 00 00 00 00 01 00 00  00 00 00 00 00 00 01 00  |................|
000000f0  00 1d 01 00 00 00 00 01  00 00 00 00 00 00 0a 00  |................|
00000100  00 00 00 00 f6 00 00 00  0a 54 65 73 74 69 6e 67  |.........Testing|
00000110  20 20 00 0a 01 09 10 c0  14 14 42 07 01           |  ........B..|

Then all samples from version 4 to the final version 7 all have the same header, although I know there is some features in the later versions that make them incompatible, there isn’t a easy way to identify the different versions after version 4.

hexdump -C Asksam4-s01.ask | head
00000000  61 73 6b 77 34 30 00 00  00 00 25 00 00 00 00 00  |askw40....%.....|
00000010  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
*
000000f0  00 00 00 00 00 00 00 00  00 00 02 00 00 00 e5 38  |...............8|
00000100  0c 3a 67 31 4d 38 dd b5  9c 65 00 00 00 00 90 01  |.:g1M8...e......|
00000110  00 00 01 01 0c 43 00 00  00 00 00 00 be 00 00 00  |.....C..........|
00000120  24 14 00 00 00 00 00 00  10 14 00 00 00 00 00 00  |$...............|
00000130  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
*
00000150  7b 4c 00 00 00 00 00 00  af 4f 00 00 00 00 00 00  |{L.......O......|

hexdump -C AskSam6-s01.ask | head
00000000  61 73 6b 77 34 30 00 00  00 00 38 00 00 00 00 00  |askw40....8.....|
00000010  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
*
000000f0  00 00 00 00 00 00 00 00  00 00 02 00 00 00 21 f1  |..............!.|
00000100  ad 41 61 9f c0 39 cd 4a  af 65 00 00 00 00 58 02  |.Aa..9.J.e....X.|
00000110  00 00 01 01 84 2e 00 00  00 00 00 00 be 00 00 00  |................|
00000120  24 14 00 00 00 00 00 00  50 13 00 00 00 00 00 00  |$.......P.......|
00000130  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
00000140  00 00 00 00 00 00 00 00  00 00 00 00 c6 5b 00 00  |.............[..|
00000150  ba 33 00 00 00 00 00 00  53 33 00 00 00 00 00 00  |.3......S3......|

hexdump -C AskSam7-s01.ask | head
00000000  61 73 6b 77 34 30 00 00  00 00 87 04 00 00 00 00  |askw40..........|
00000010  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
*
000000f0  00 00 00 00 00 00 00 00  00 00 02 00 00 00 b2 fd  |................|
00000100  b5 47 61 9f c0 39 5c 4b  af 65 00 00 00 00 bc 02  |.Ga..9\K.e......|
00000110  00 00 01 01 db 34 00 00  00 00 00 00 be 00 00 00  |.....4..........|
00000120  24 14 00 00 00 00 00 00  50 13 00 00 00 00 00 00  |$.......P.......|
00000130  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
*
00000150  aa 39 00 00 00 00 00 00  de 3c 00 00 00 00 00 00  |.9.......<......|

Even though everything after version 4 for Windows has the same header, files created in version 7 will not open in version 6. There must be some additional byte sequences which identify the files with the version which created the file. I have been unable to located the free askSam 7 viewer, but here is a link to the version 6 free viewer. It runs in the latest Windows OS. If you open an older version it will ask you to upgrade your file, so be sure to keep a copy of your original.

Once you have your ASK Database opened, you can export to a few formats, an RTF or a delimitated text file based on fields you have entered in the form. Word of warning, if you entered a password to protect modifying of your data in an earlier version, you have to re-enter the password in order to open/upgrade the file, but the viewer will not open password protected files, you will need the full version.

Here are two files created in AskSam 5.11 DOS, one without a password one with. You can see the 16 byte hex values from offset 41 to 57 are zeros in the file with no password and full of values in the protected file. I’m sure someone with more skills could figure out the encryption.

hexdump -C AS5-OPEN.ASK 
00000000  61 73 6b 53 61 6d 00 00  00 00 00 07 0f 01 00 00  |askSam..........|
00000010  01 00 00 00 00 01 00 05  00 37 00 02 00 00 00 01  |.........7......|
00000020  33 00 32 00 00 00 00 00  50 00 00 00 00 00 00 00  |3.2.....P.......|
00000030  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
*
000000d0  00 14 00 01 00 00 01 00  00 00 00 00 00 00 00 00  |................|
000000e0  00 00 00 00 00 01 00 00  00 00 03 1d 42 00 01 00  |............B...|
000000f0  00 13 01 00 00 00 00 01  00 00 00 00 00 00 00 00  |................|
00000100  00 00 00 00 f6 00 00 00  00 54 65 73 74 69 6e 67  |.........Testing|
00000110  01 01 00                                          |...|

hexdump -C AS5-PASS.ASK 
00000000  61 73 6b 53 61 6d 00 00  01 00 00 07 0f 01 00 00  |askSam..........|
00000010  01 00 00 00 00 01 00 05  00 37 00 02 00 00 00 01  |.........7......|
00000020  33 00 32 00 00 00 00 00  50 66 5f 14 66 42 53 40  |3.2.....Pf_.fBS@|
00000030  42 71 29 59 6a 61 62 60  6e 00 00 00 00 00 00 00  |Bq)Yjab`n.......|
*
000000d0  00 14 00 01 00 00 01 00  00 00 00 00 00 00 00 00  |................|
000000e0  00 00 00 00 00 01 00 00  00 00 03 1d 42 00 01 00  |............B...|
000000f0  00 13 01 00 00 00 00 01  00 00 00 00 00 00 00 00  |................|
00000100  00 00 00 00 f6 00 00 00  00 54 65 73 74 69 6e 67  |.........Testing|
00000110  01 01 00                                          |...|

You can check out my samples and my recommendation to PRONOM on my Github page.