python树形 解析html,HTML Parse tree using Python 2.7

郝冥夜
2023-12-01

I was trying to get configure one parse tree for the below HTML table,but couldn't form it.I want to see how the tree structure looks like!can anyone help me here?

#

#

#

# The Dormouse's story

#

#

#

#

#

# The Dormouse's story

#

#

#

# Once upon a time there were three little sisters; and their names were

#

# Elsie

#

# ,

#

# Lacie

#

# and

#

# Tillie

#

# ; and they lived at the bottom of a well.

#

#

# ...

#

#

#

EDIT

Microsoft Windows [Version 6.1.7600]

Copyright (c) 2009 Microsoft Corporation. All rights reserved.

C:\Users\matt>easy_install ete2

Searching for ete2

Reading http://pypi.python.org/simple/ete2/

Reading http://ete.cgenomics.org

Reading http://ete.cgenomics.org/releases/ete2/

Reading http://ete.cgenomics.org/releases/ete2

Best match: ete2 2.1rev539

Downloading http://ete.cgenomics.org/releases/ete2/ete2-2.1rev539.tar.gz

Processing ete2-2.1rev539.tar.gz

Running ete2-2.1rev539\setup.py -q bdist_egg --dist-dir c:\users\arupra~1\appdat

a\local\temp\easy_install-sypg3x\ete2-2.1rev539\egg-dist-tmp-zemohm

Installing ETE (A python Environment for Tree Exploration).

Checking dependencies...

numpy cannot be found in your python installation.

Numpy is required for the ArrayTable and ClusterTree classes.

MySQLdb cannot be found in your python installation.

MySQLdb is required for the PhylomeDB access API.

PyQt4 cannot be found in your python installation.

PyQt4 is required for tree visualization and image rendering.

lxml cannot be found in your python installation.

lxml is required from Nexml and Phyloxml support.

However, you can still install ETE without such functionality.

Do you want to continue with the installation anyway? [y,n]y

Your installation ID is: d33ba3b425728e95c47cdd98acda202f

warning: no files found matching '*' under directory '.'

warning: no files found matching '*.*' under directory '.'

warning: manifest_maker: MANIFEST.in, line 4: path 'doc/ete_guide/' cannot end w

ith '/'

warning: manifest_maker: MANIFEST.in, line 5: path 'doc/' cannot end with '/'

warning: no previously-included files matching '*.pyc' found under directory '.'

zip_safe flag not set; analyzing archive contents...

Adding ete2 2.1rev539 to easy-install.pth file

Installing ete2 script to C:\Python27\Scripts

Installed c:\python27\lib\site-packages\ete2-2.1rev539-py2.7.egg

Processing dependencies for ete2

Finished processing dependencies for ete2

 类似资料: