Velocity Reviews - Computer Hardware Reviews

Velocity Reviews > Newsgroups > Programming > Python > 3.2 can't extract tarfile produced by 2.7

Thread Tools

3.2 can't extract tarfile produced by 2.7

Antoon Pardon
Posts: n/a
I am converting some programs to python 3. These programs manipulate
tarfiles. In order for the python3 programs to be really useful
they need to be able to process the tarfiles produced by python2 that
however seems to be a problem.

This is testcode that produces a tarfile.

#! /usr/bin/python

compression = "bz2"
tarmode = "w|%s" % compression
rt = '.'

import os
import os.path
import errno

import tarfile as tar

def process():
pj = os.path.join
entries = os.listdir(rt)
of = open("DUMP.tbz", "w")
tf = = tarmode, fileobj = of,
encoding = 'ascii', format = tar.PAX_FORMAT)
for entry in entries:
fqpn = pj(rt, entry)
tf.add(fqpn, entry, recursive = False)
except OSError as ErrInfo:
print("%s: disappeared" % fqpn)
if ErrInfo.errno != errno.ENOENT:

if __name__ == "__main__":

================================================== ============================
This is testcode that checks a tarfile


compression = "bz2"
tarmode = "r|%s" % compression

import os
import os.path
import stat

import tarfile as tar

def equalfile(fl1, fl2):
bf1 =
bf2 =
while bf1 == bf2:
if bf1 == "":
return True
bf1 =
bf2 =
return False

def process():
gf = open("DUMP.tbz", "r")
tf = = tarmode, fileobj = gf,
encoding = 'ascii', format = tar.PAX_FORMAT)
for tarinfo in tf:
entry =
fileinfo = os.stat(entry)
if stat.S_ISREG(fileinfo.st_mode) and tarinfo.isreg():
bfl = tf.extractfile(tarinfo)
ofl = open(entry)
if not equalfile(bfl, ofl):
print("%s: does not match backup" % entry)
sync = False

if __name__ == "__main__":

================================================== ===============================

When I use python2.7 to produce and later check the tarfile everything
works as expected. However when I use python3.2 to check the tarfile I
get the following traceback.

Traceback (most recent call last):
File "tarchck", line 39, in <module>
File "tarchck", line 25, in process
encoding = 'ascii', format = tar.PAX_FORMAT)
File "/usr/lib/python3.2/", line 1771, in open
t = cls(name, filemode, stream, **kwargs)
File "/usr/lib/python3.2/", line 1667, in __init__
self.firstmember =
File "/usr/lib/python3.2/", line 2418, in next
tarinfo = self.tarinfo.fromtarfile(self)
File "/usr/lib/python3.2/", line 1281, in fromtarfile
buf =
File "/usr/lib/python3.2/", line 573, in read
buf = self._read(size)
File "/usr/lib/python3.2/", line 585, in _read
buf = self.__read(self.bufsize)
File "/usr/lib/python3.2/", line 604, in __read
buf =
File "/usr/lib/python3.2/", line 300, in decode
(result, consumed) = self._buffer_decode(data, self.errors, final)
UnicodeDecodeError: 'utf-8' codec can't decode byte 0x9e in position 10:
invalid start byte

I have been looking around but have no idea how I have to adapt this
code in order to have it process the tarfile under python3.2. The
original code didn't have the coding and format keywords on the
statement and after reading the documentation I thought that
would make things work, but no such luck. Further reading didn't
provide anything usefull

Antoon Pardon
Reply With Quote

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are Off

Similar Threads
Thread Thread Starter Forum Replies Last Post
Re: 3.2 can't extract tarfile produced by 2.7 Ian Kelly Python 0 12-27-2012 07:27 PM
Re: 3.2 can't extract tarfile produced by 2.7 Ian Kelly Python 0 12-27-2012 07:25 PM
Re: 3.2 can't extract tarfile produced by 2.7 Steven W. Orr Python 0 12-27-2012 06:50 PM
How do i extract vidios when winrar wont extract them??? help plzzzzzzzz Computer Support 2 12-23-2007 07:03 AM
umask option for tarfile extract Faik Uygur Python 0 06-15-2006 11:42 AM