Velocity Reviews

Velocity Reviews (http://www.velocityreviews.com/forums/index.php)
-   Python (http://www.velocityreviews.com/forums/f43-python.html)
-   -   using python to visit web sites and print the web sites image to files (http://www.velocityreviews.com/forums/t483513-using-python-to-visit-web-sites-and-print-the-web-sites-image-to-files.html)

imx 03-12-2007 07:32 AM

using python to visit web sites and print the web sites image to files
 
Hi there,

I wonder whether python can be used to simulate a real user to do the
following:
1) open a web site in a browser;
2) printscreen, so to copy the current active window image to
clipboard;
3) save the image file to a real file

Any pointer will be apprieciated!

Xiong


daftspaniel@gmail.com 03-12-2007 04:39 PM

Re: using python to visit web sites and print the web sites image to files
 
On Mar 12, 7:32 am, "imx" <xiong.xu...@gmail.com> wrote:
> I wonder whether python can be used to simulate a real user to do the
> following:
> 1) open a web site in a browser;
> 2) printscreen, so to copy the current active window image to
> clipboard;
> 3) save the image file to a real file
> Any pointer will be apprieciated!
> Xiong


Google pywinauto.

HTH

Davy



Michael Bentley 03-12-2007 05:08 PM

Re: using python to visit web sites and print the web sites image tofiles
 

>
> I wonder whether python can be used to simulate a real user to do the
> following:
> 1) open a web site in a browser;
> 2) printscreen, so to copy the current active window image to
> clipboard;
> 3) save the image file to a real file
>
> Any pointer will be apprieciated!


Which OS?



Goldfish 03-12-2007 07:40 PM

Re: using python to visit web sites and print the web sites image to files
 
You can definitely create a web bot with python. It doesn't require
that you "drive" A real web browser. There are libraries to open web
pages, scrape their contents, and do downloading. That would make your
bot platform neutral. Driving a GUI browser has the risk of being a
brittle script that might not handle different browsers, different
platforms, maybe even not handle different versions.

I run a mediawiki web site, and found a handy python-based library
written to manage it called pywikipediabot at http://sourceforge.net/projects/pywikipediabot/.

Okay, this library won't do your leg work for you, but it has pieces
and parts that demonstrate how to use python to surf a web site. Then,
with an HTML parser, you can hunt down images.

Greg


Alberto Monteiro 03-12-2007 07:49 PM

Re: using python to visit web sites and print the web sites image tofiles
 
Goldfish wrote:
>
> I run a mediawiki web site, and found a handy python-based library
> written to manage it called pywikipediabot at

http://sourceforge.net/projects/pywikipediabot/.
>

This sounds interesting. My daughter had a nightmare that a hacker
invaded her Orkut and blanked all 1500+ scraps. This is not impossible.
Maybe I should save the contents to a file...

Alberto Monteiro


Paul Boddie 03-12-2007 08:26 PM

Re: using python to visit web sites and print the web sites image to files
 
Goldfish wrote:
> You can definitely create a web bot with python. It doesn't require
> that you "drive" A real web browser.


That's true, but if you want to print the page to a file, you need
something that can reproduce the intended layout. The Pyglet library
developers mention "XML/HTML+CSS" as something the layout engine can
deal with, which sounds quite impressive if its support of CSS is
comprehensive:

http://pyglet.org/

Paul


imx 03-13-2007 12:54 AM

Re: using python to visit web sites and print the web sites image to files
 
On 3月13日, 上午4时26分, "Paul Boddie" <p...@boddie..org.uk> wrote:
> Goldfish wrote:
> > You can definitely create a web bot with python. It doesn't require
> > that you "drive" A real web browser.

>
> That's true, but if you want to print the page to a file, you need
> something that can reproduce the intended layout. The Pyglet library
> developers mention "XML/HTML+CSS" as something the layout engine can
> deal with, which sounds quite impressive if its support of CSS is
> comprehensive:
>
> http://pyglet.org/
>
> Paul


Thanks for all the replies.
I will check pyglet to see if it can help.

The reason I want to do simulation but not just crawling is : we have
to check many web pages' front page to see whether it conform to our
visual standard, e.g, it should put a search box on the top part of
the page. It's tedious for human work. So I want to 'crawl and save
the visual presentation of the web site automatically', and check
these image files later with human eyes.

-Xiong


imx 03-13-2007 03:34 AM

Re: using python to visit web sites and print the web sites image to files
 
On 3月13日, 上午12时39分, "daftspan...@gmail.com" <daftspan...@gmail.com>
wrote:
> On Mar 12, 7:32 am, "imx" <xiong.xu...@gmail.com> wrote:
>
> > I wonder whether python can be used to simulate a real user to do the
> > following:
> > 1) open a web site in a browser;
> > 2) printscreen, so to copy the current active window image to
> > clipboard;
> > 3) save the image file to a real file
> > Any pointer will be apprieciated!
> > Xiong

>
> Google pywinauto.
>
> HTH
>
> Davy


I checked pyglet, it's in early development stage. Since I'm using
windows, I will try pywinauto.

Thanks,
Xiong


daftspaniel@gmail.com 03-13-2007 09:44 PM

Re: using python to visit web sites and print the web sites image to files
 
> The reason I want to do simulation but not just crawling is : we have
> to check many web pages' front page to see whether it conform to our
> visual standard, e.g, it should put a search box on the top part of
> the page. It's tedious for human work. So I want to 'crawl and save
> the visual presentation of the web site automatically', and check
> these image files later with human eyes.
>
> -Xiong


Hi Xiong,

I have been working on a program to do something very similar to
generate thumbnails of websites.

The code is in IronPython (which may put you off!) and would need
modified or scripted with pywinauto to deal with multiple images.

Let me know if it is of use to you and I will upload it.

Cheers,
Davy


imx 03-14-2007 09:02 AM

Re: using python to visit web sites and print the web sites image to files
 
On 3月14日, 上午5时44分, "daftspan...@gmail.com" <daftspan...@gmail.com>
wrote:
> > The reason I want to do simulation but not just crawling is : we have
> > to check many web pages' front page to see whether it conform to our
> > visual standard, e.g, it should put a search box on the top part of
> > the page. It's tedious for human work. So I want to 'crawl and save
> > the visual presentation of the web site automatically', and check
> > these image files later with human eyes.

>
> > -Xiong

>
> Hi Xiong,
>
> I have been working on a program to do something very similar to
> generate thumbnails of websites.
>
> The code is in IronPython (which may put you off!) and would need
> modified or scripted with pywinauto to deal with multiple images.
>
> Let me know if it is of use to you and I will upload it.
>
> Cheers,
> Davy


Cool, but does it mean that I will need .net to run the code?

Xiong



All times are GMT. The time now is 01:12 AM.

Powered by vBulletin®. Copyright ©2000 - 2013, vBulletin Solutions, Inc.
SEO by vBSEO ©2010, Crawlability, Inc.


1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57