read_html: fails to parse column · Issue #3606 · pandas-dev/pandas (original) (raw)

Skip to content

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sign up

Appearance settings

@timmie

Description

@timmie

The second column of the table
http://code.google.com/p/pythonxy/wiki/StandardPlugins#Python_packages

is not parsed as shown with this code:

-- coding: utf-8 --

3.0

import pandas as pd

url = 'http://code.google.com/p/pythonxy/wiki/StandardPlugins'

dfs = pd.read_html(url, attrs={'class': 'wikitable'})

dfs

dfs = pd.read_html(url, flavor='lxml', attrs={'class': 'wikitable'})

dfs

python_core = dfs[0]

python_core[:10]