Audio/video stream recording forums (http://stream-recorder.com/forum/index.php)
-   Removing DRM protection from eBooks (http://stream-recorder.com/forum/forumdisplay.php?f=63)
-   -  

How to convert Topaz ebooks to HTML (Remove DRM from TPZ and AZW1 books for Kindle)

(http://stream-recorder.com/forum/showthread.php?t=5426)

any ANONYMOUS forum user 04-12-2010 03:12 AM

Re: How to convert Topaz ebooks to HTML (Remove DRM from TPZ and AZW1 books for Kindl


 
Quote:

Originally Posted by jcklaus (Post 18075)
Running this I keep getting the error "Can not find dict0000.dat file" What am I doing wrong?

Running what? On what OS? On what files?

jcklaus 04-12-2010 03:35 PM

Re: How to convert Topaz ebooks to HTML (Remove DRM from TPZ and AZW1 books for Kindl


 
Running Mac OS X Terminal and using the command line:

python TopazFiles2HTML.pyw MYTOPAZBOOKNAME

Stream Recorder 04-12-2010 11:43 PM

Re: How to convert Topaz ebooks to HTML (Remove DRM from TPZ and AZW1 books for Kindl


 
Quote:

Originally Posted by jcklaus (Post 18113)
Running Mac OS X Terminal and using the command line:

python TopazFiles2HTML.pyw MYTOPAZBOOKNAME

The files in the lib directory are used by the script. Make sure to extract the lib directory with the other scripts.

From my understanding, you need to run
1. TopazExtract_Kindle_iPhone.pyw
2. then run TopazFiles2XML.pyw,
3. and then run either TopazFiles2SVG.pyw or TopazFiles2HTML.pyw
May be I'm wrong. I don't have a Kindle to check it out.

You can also try to run cmbdtc.py on your Topaz ebook
Code:

cmbtc_dump.py -d -o TARGETDIR [-p pid] YOURTOPAZBOOKNAMEHERE
and see whether you get any errors.

Maradona10 04-30-2010 11:14 AM

Re: How to convert Topaz ebooks to HTML (Remove DRM from TPZ and AZW1 books for Kindl


 
Quote:

Originally Posted by Stream Recorder (Post 17767)
The following set of tools can also be used to remove DRM from Amazon Topaz eBooks:
  • TopazExtract_Kindle_iPhone.pyw,
  • TopazFiles2XML.pyw,
  • TopazFiles2SVG.pyw,
  • TopazFiles2HTML.pyw

tools_v1.6b.zip.
Code:

http://www.mediafire.com/?mn3vmttbwrt
The scripts should work with Kindle and iPhone Amazon Topaz Files (.tpz, .azw1). The files are really images of pages with OCR performed on them. Using the tools you can get SVG images of the pages, and the OCRed HTML version for clean-up.

I have downloaded these scripts. But can you please explain how to execute them properly and in which order? Cause I'm new to python. I have a topaz book on my iphone and want to convert it, but don't know how.

Any help appreciated.

jcklaus 05-01-2010 06:03 PM

Re: How to convert Topaz ebooks to HTML (Remove DRM from TPZ and AZW1 books for Kindl


 
Quote:

Originally Posted by Stream Recorder (Post 18120)
The files in the lib directory are used by the script. Make sure to extract the lib directory with the other scripts.

From my understanding, you need to run
1. TopazExtract_Kindle_iPhone.pyw
2. then run TopazFiles2XML.pyw,
3. and then run either TopazFiles2SVG.pyw or TopazFiles2HTML.pyw
May be I'm wrong. I don't have a Kindle to check it out.

You can also try to run cmbdtc.py on your Topaz ebook
Code:

cmbtc_dump.py -d -o TARGETDIR [-p pid] YOURTOPAZBOOKNAMEHERE
and see whether you get any errors.

Whenever I try the first step of running TopazExtract_Kindle_Iphone.pyw I get the following error message:

File "./lib/cmbtc_dump_nonK4PC.py", line 517, in <module>
sys.exit(main())
File "./lib/cmbtc_dump_nonK4PC.py", line 478, in main
bookFile = openBook(args[0])
File "./lib/cmbtc_dump_nonK4PC.py", line 57, in openBook
raise CMBDTCFatal("Could not open book file: " + path)
__main__.CMBDTCFatal: Could not open book file:

Stream Recorder 05-02-2010 01:31 AM

Re: How to convert Topaz ebooks to HTML (Remove DRM from TPZ and AZW1 books for Kindl


 
Quote:

Originally Posted by jcklaus (Post 18666)
Whenever I try the first step of running TopazExtract_Kindle_Iphone.pyw I get the following error message:

File "./lib/cmbtc_dump_nonK4PC.py", line 517, in <module>
sys.exit(main())
File "./lib/cmbtc_dump_nonK4PC.py", line 478, in main
bookFile = openBook(args[0])
File "./lib/cmbtc_dump_nonK4PC.py", line 57, in openBook
raise CMBDTCFatal("Could not open book file: " + path)
__main__.CMBDTCFatal: Could not open book file:

Are you trying to remove DRM from Topaz book? Or mobipocket book?

jcklaus 05-02-2010 07:20 AM

Re: How to convert Topaz ebooks to HTML (Remove DRM from TPZ and AZW1 books for Kindl


 
I'm trying to remove DRM from my Topaz azw1 files to eventually convert to epub

yankgirl013 06-01-2010 07:13 AM

Re: How to convert Topaz ebooks to HTML (Remove DRM from TPZ and AZW1 books for Kindl


 
Hi, I'm hoping someone can help me here. I'm trying to remove a DRM off a topaz file and I'm not getting anywhere. I'm using a Mac OS.
I've downloaded all the files listed, but I keep getting a 'Can not find dict0000.dat file' error.

Is there a way we can 'dumb' the directions down? I've removed them from azw and mobi using terminal and python scripts.

Thanks so much!!!

It really doesn't matter what I convert it to, I can just change it to epub using calibre

any ANONYMOUS forum user 06-01-2010 11:30 AM

Re: How to convert Topaz ebooks to HTML (Remove DRM from TPZ and AZW1 books for Kindl


 
Quote:

Originally Posted by yankgirl013 (Post 19314)
I keep getting a 'Can not find dict0000.dat file' error.

Not sure whether it will help

Quote:

Originally Posted by some updates
That means it can’t find your dict0000.dat file which should be right where you are.

This can not be the true error since other pages worked.

Please make sure that all of these are in the some location (i.e. side by side inside of TARGETDIR

convert2xml.py
dict0000.dat
pageNNNN.dat

where NNNN is the number of the problem page*.dat file

Then make sure you have cd to the TARGETDIR and then run

convert2xml.py -d dict0000.dat pageNNNN.dat > debug.txt

where again the NNNN is the number of the page file that does not work.

Then look in debug.txt for “Unknown” or any other warning or error message and post at http://darkreverser.wordpress.com/2008/02/13/new-blog/ what it says around that point in the debug.txt file.


djpyle 06-09-2010 06:24 PM

Re: How to convert Topaz ebooks to HTML (Remove DRM from TPZ and AZW1 books for Kindl


 
Quote:

Originally Posted by jcklaus (Post 18666)
Whenever I try the first step of running TopazExtract_Kindle_Iphone.pyw I get the following error message:

File "./lib/cmbtc_dump_nonK4PC.py", line 517, in <module>
sys.exit(main())
File "./lib/cmbtc_dump_nonK4PC.py", line 478, in main
bookFile = openBook(args[0])
File "./lib/cmbtc_dump_nonK4PC.py", line 57, in openBook
raise CMBDTCFatal("Could not open book file: " + path)
__main__.CMBDTCFatal: Could not open book file:

I'm getting this same error trying to remove DRM from a .tpz file. Any ideas? I'm running OSX 10.6.3.

If I try to run it without the PID, I get:

Traceback (most recent call last):
File "./lib/cmbtc_dump.py", line 37, in <module>
from ctypes import windll, c_char_p, c_wchar_p, c_uint, POINTER, byref, \
ImportError: cannot import name windll

Error: File Extraction Failed


All times are GMT -6. The time now is 10:03 PM.