Velocity Reviews > convert a long string in binary

# convert a long string in binary

bussiere maillist
Guest
Posts: n/a

 08-20-2006
i've got a very long string
and i wanted to convert it in binary
like

string = """Monty Python, or The Pythons, is the collective name of
the creators of Monty Python's Flying Circus, a British television
comedy sketch show that first aired on the BBC on October 5, 1969. A
total of 45 episodes were made over four series. However, the Python
phenomenon was much greater, spawning stage tours, a musical, four
films, numerous albums, and several books, as well as launching the
members to individual stardom.

The television series, broadcast by the BBC from 1969 to 1974, was
conceived, written and performed by Graham Chapman, John Cleese
(1969-1973), Terry Gilliam, Eric Idle, Terry Jones and Michael Palin.
Loosely structured as a sketch show, but with a highly innovative
stream-of-consciousness approach (aided by Terry Gilliam's
animations), it pushed the boundaries of what was then considered
acceptable, both in terms of style and content.
"""

string = binary(string)
print string
// 01010101010101

i 'am searching for something like the binary function (taht doesn't
exist) to convert my string directly in binary.

Regards
Bussiere

bearophileHUGS@lycos.com
Guest
Posts: n/a

 08-20-2006
bussiere maillist:
> i've got a very long string
> and i wanted to convert it in binary

Not much tested:

_nibbles = {"0":"0000", "1":"0001", "2":"0010", "3":"0011",
"4":"0100", "5":"0101", "6":"0110", "7":"0111",
"8":"1000", "9":"1001", "A":"1010", "B":"1011",
"C":"1100", "D":"1101", "E":"1110", "F":"1111"}

def toBase2(number):
if number < 16:
return "0000" + _nibbles["%X" % number]
else:
d1, d2 = "%X" % number
return _nibbles[d1] + _nibbles[d2]

convbin = dict((chr(i), toBase2(i)) for i in xrange(256))

def binary(s):
return "".join(convbin[c] for c in s)

print binary("testing string")

Surely there are ways to make it shorter (But it's fast enough).

Bye,
bearophile

mensanator@aol.com
Guest
Posts: n/a

 08-20-2006

bussiere maillist wrote:
> i've got a very long string
> and i wanted to convert it in binary
> like
>
> string = """Monty Python, or The Pythons, is the collective name of
> the creators of Monty Python's Flying Circus, a British television
> comedy sketch show that first aired on the BBC on October 5, 1969. A
> total of 45 episodes were made over four series. However, the Python
> phenomenon was much greater, spawning stage tours, a musical, four
> films, numerous albums, and several books, as well as launching the
> members to individual stardom.
>
> The television series, broadcast by the BBC from 1969 to 1974, was
> conceived, written and performed by Graham Chapman, John Cleese
> (1969-1973), Terry Gilliam, Eric Idle, Terry Jones and Michael Palin.
> Loosely structured as a sketch show, but with a highly innovative
> stream-of-consciousness approach (aided by Terry Gilliam's
> animations), it pushed the boundaries of what was then considered
> acceptable, both in terms of style and content.
> """
>
> string = binary(string)
> print string
> // 01010101010101
>
>
> i 'am searching for something like the binary function (taht doesn't
> exist) to convert my string directly in binary.

The gmpy module can convert numbers to binary strings:

import gmpy

s = """
Yes, well, of course that's just the sort of
blinkered philistine pig ignorance I've come
to expect from you non-creative garbage.
"""

ss = ''
for c in s:
sb = gmpy.digits(ord(c),2) #convert to binary string
ss += sb

print ss

00001010010110010110010101110011001011000010000001 11011101100101011011000110110000101100001000000110 11110110011000100000011000110110111101110101011100 10011100110110010100100000011101000110100001100001 01110100001001110111001100100000011010100111010101 11001101110100001000000111010001101000011001010010 00000111001101101111011100100111010000100000011011 11011001100000101001100010011011000110100101101110 01101011011001010111001001100101011001000010000001 11000001101000011010010110110001101001011100110111 01000110100101101110011001010010000001110000011010 01011001110010000001101001011001110110111001101111 01110010011000010110111001100011011001010010000001 00100100100111011101100110010100100000011000110110 11110110110101100101000010100111010001101111001000 00011001010111100001110000011001010110001101110100 00100000011001100111001001101111011011010010000001 11100101101111011101010010000001101110011011110110 11100010110101100011011100100110010101100001011101 00011010010111011001100101001000000110011101100001 01110010011000100110000101100111011001010010111000 001010

> Regards
> Bussiere

Maric Michaud
Guest
Posts: n/a

 08-22-2006
Le dimanche 20 août 2006 13:55, http://www.velocityreviews.com/forums/(E-Mail Removed) a écrit*:
> bussiere maillist:
> > i've got a very long string
> > and i wanted to convert it in binary

>
> Not much tested:
>
> _nibbles = {"0":"0000", "1":"0001", "2":"0010", "3":"0011",
> "4":"0100", "5":"0101", "6":"0110", "7":"0111",
> "8":"1000", "9":"1001", "A":"1010", "B":"1011",
> "C":"1100", "D":"1101", "E":"1110", "F":"1111"}
>
> def toBase2(number):
> if number < 16:
> return "0000" + _nibbles["%X" % number]
> else:
> d1, d2 = "%X" % number
> return _nibbles[d1] + _nibbles[d2]
>
> convbin = dict((chr(i), toBase2(i)) for i in xrange(256))
>
> def binary(s):
> return "".join(convbin[c] for c in s)
>
> print binary("testing string")
>
> Surely there are ways to make it shorter (But it's fast enough).
>

Maybe this one is more elegant :

In [305]: trans = {}

In [306]: for i in range(256) :
.....: trans[chr(i)] = ''.join(i & 2**j and '1' or '0'
.....: for j in reversed(range())
.....:

In [307]: trans['a']
Out[307]: '01100001'

In [308]: trans['\xee']
Out[308]: '11101110'

In [309]: print ''.join(trans[e] for e in 'test string')
01110100011001010111001101110100001000000111001101 11010001110010011010010110111001100111

> Bye,
> bearophile

--
_____________

Maric Michaud
_____________

Aristote - www.aristote.info
3 place des tapis
69004 Lyon
Tel: +33 426 880 097