pd.read_html() imports a list rather than a dataframe
I used pd.read_html()
to import a table from a webpage but instead of structuring the data as a dataframe Python imported it as a list. How can I import the data as a dataframe? Thank you!
The code is the following:
import pandas as pd
import html5lib
url = 'http://www.fdic.gov/bank/individual/failed/banklist.html'
dfs = pd.read_html(url)
type(dfs)
Out[1]: list
python html pandas
add a comment |
I used pd.read_html()
to import a table from a webpage but instead of structuring the data as a dataframe Python imported it as a list. How can I import the data as a dataframe? Thank you!
The code is the following:
import pandas as pd
import html5lib
url = 'http://www.fdic.gov/bank/individual/failed/banklist.html'
dfs = pd.read_html(url)
type(dfs)
Out[1]: list
python html pandas
add a comment |
I used pd.read_html()
to import a table from a webpage but instead of structuring the data as a dataframe Python imported it as a list. How can I import the data as a dataframe? Thank you!
The code is the following:
import pandas as pd
import html5lib
url = 'http://www.fdic.gov/bank/individual/failed/banklist.html'
dfs = pd.read_html(url)
type(dfs)
Out[1]: list
python html pandas
I used pd.read_html()
to import a table from a webpage but instead of structuring the data as a dataframe Python imported it as a list. How can I import the data as a dataframe? Thank you!
The code is the following:
import pandas as pd
import html5lib
url = 'http://www.fdic.gov/bank/individual/failed/banklist.html'
dfs = pd.read_html(url)
type(dfs)
Out[1]: list
python html pandas
python html pandas
edited Sep 26 '16 at 19:34
alecxe
327k71643867
327k71643867
asked Sep 26 '16 at 19:31
AlKAlK
76415
76415
add a comment |
add a comment |
1 Answer
1
active
oldest
votes
.read_html()
produces a list of dataframes (there could be multiple tables in an HTML source), get the desired one by index. In your case, there is a single dataframe:
dfs = pd.read_html(url)
df = dfs[0]
print(df)
Note that, if there are no table
s in the HTML source, it would return an error and would never produce an empty list.
Thank you! Indeed df is a dataframe. However when I tried the .head, .tail and .index parameters on df Python returned error messages. How can I correct this?
– AlK
Sep 26 '16 at 19:45
@AlexanderKonstantinidis interesting,df.tail
,df.head
,df.index
work for me. What error(s) are you getting?
– alecxe
Sep 26 '16 at 19:48
AttributeError: 'DataFrame' object has no attribute 'heads', AttributeError: 'DataFrame' object has no attribute 'tails', TypeError: 'Index' object is not callable
– AlK
Sep 26 '16 at 19:51
1
@AlexanderKonstantinidis ah, it istail
,head
andindex
- nos
prefix and no()
(these are not methods).
– alecxe
Sep 26 '16 at 19:56
add a comment |
Your Answer
StackExchange.ifUsing("editor", function () {
StackExchange.using("externalEditor", function () {
StackExchange.using("snippets", function () {
StackExchange.snippets.init();
});
});
}, "code-snippets");
StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "1"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});
function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});
}
});
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f39710903%2fpd-read-html-imports-a-list-rather-than-a-dataframe%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
1 Answer
1
active
oldest
votes
1 Answer
1
active
oldest
votes
active
oldest
votes
active
oldest
votes
.read_html()
produces a list of dataframes (there could be multiple tables in an HTML source), get the desired one by index. In your case, there is a single dataframe:
dfs = pd.read_html(url)
df = dfs[0]
print(df)
Note that, if there are no table
s in the HTML source, it would return an error and would never produce an empty list.
Thank you! Indeed df is a dataframe. However when I tried the .head, .tail and .index parameters on df Python returned error messages. How can I correct this?
– AlK
Sep 26 '16 at 19:45
@AlexanderKonstantinidis interesting,df.tail
,df.head
,df.index
work for me. What error(s) are you getting?
– alecxe
Sep 26 '16 at 19:48
AttributeError: 'DataFrame' object has no attribute 'heads', AttributeError: 'DataFrame' object has no attribute 'tails', TypeError: 'Index' object is not callable
– AlK
Sep 26 '16 at 19:51
1
@AlexanderKonstantinidis ah, it istail
,head
andindex
- nos
prefix and no()
(these are not methods).
– alecxe
Sep 26 '16 at 19:56
add a comment |
.read_html()
produces a list of dataframes (there could be multiple tables in an HTML source), get the desired one by index. In your case, there is a single dataframe:
dfs = pd.read_html(url)
df = dfs[0]
print(df)
Note that, if there are no table
s in the HTML source, it would return an error and would never produce an empty list.
Thank you! Indeed df is a dataframe. However when I tried the .head, .tail and .index parameters on df Python returned error messages. How can I correct this?
– AlK
Sep 26 '16 at 19:45
@AlexanderKonstantinidis interesting,df.tail
,df.head
,df.index
work for me. What error(s) are you getting?
– alecxe
Sep 26 '16 at 19:48
AttributeError: 'DataFrame' object has no attribute 'heads', AttributeError: 'DataFrame' object has no attribute 'tails', TypeError: 'Index' object is not callable
– AlK
Sep 26 '16 at 19:51
1
@AlexanderKonstantinidis ah, it istail
,head
andindex
- nos
prefix and no()
(these are not methods).
– alecxe
Sep 26 '16 at 19:56
add a comment |
.read_html()
produces a list of dataframes (there could be multiple tables in an HTML source), get the desired one by index. In your case, there is a single dataframe:
dfs = pd.read_html(url)
df = dfs[0]
print(df)
Note that, if there are no table
s in the HTML source, it would return an error and would never produce an empty list.
.read_html()
produces a list of dataframes (there could be multiple tables in an HTML source), get the desired one by index. In your case, there is a single dataframe:
dfs = pd.read_html(url)
df = dfs[0]
print(df)
Note that, if there are no table
s in the HTML source, it would return an error and would never produce an empty list.
answered Sep 26 '16 at 19:36
alecxealecxe
327k71643867
327k71643867
Thank you! Indeed df is a dataframe. However when I tried the .head, .tail and .index parameters on df Python returned error messages. How can I correct this?
– AlK
Sep 26 '16 at 19:45
@AlexanderKonstantinidis interesting,df.tail
,df.head
,df.index
work for me. What error(s) are you getting?
– alecxe
Sep 26 '16 at 19:48
AttributeError: 'DataFrame' object has no attribute 'heads', AttributeError: 'DataFrame' object has no attribute 'tails', TypeError: 'Index' object is not callable
– AlK
Sep 26 '16 at 19:51
1
@AlexanderKonstantinidis ah, it istail
,head
andindex
- nos
prefix and no()
(these are not methods).
– alecxe
Sep 26 '16 at 19:56
add a comment |
Thank you! Indeed df is a dataframe. However when I tried the .head, .tail and .index parameters on df Python returned error messages. How can I correct this?
– AlK
Sep 26 '16 at 19:45
@AlexanderKonstantinidis interesting,df.tail
,df.head
,df.index
work for me. What error(s) are you getting?
– alecxe
Sep 26 '16 at 19:48
AttributeError: 'DataFrame' object has no attribute 'heads', AttributeError: 'DataFrame' object has no attribute 'tails', TypeError: 'Index' object is not callable
– AlK
Sep 26 '16 at 19:51
1
@AlexanderKonstantinidis ah, it istail
,head
andindex
- nos
prefix and no()
(these are not methods).
– alecxe
Sep 26 '16 at 19:56
Thank you! Indeed df is a dataframe. However when I tried the .head, .tail and .index parameters on df Python returned error messages. How can I correct this?
– AlK
Sep 26 '16 at 19:45
Thank you! Indeed df is a dataframe. However when I tried the .head, .tail and .index parameters on df Python returned error messages. How can I correct this?
– AlK
Sep 26 '16 at 19:45
@AlexanderKonstantinidis interesting,
df.tail
, df.head
, df.index
work for me. What error(s) are you getting?– alecxe
Sep 26 '16 at 19:48
@AlexanderKonstantinidis interesting,
df.tail
, df.head
, df.index
work for me. What error(s) are you getting?– alecxe
Sep 26 '16 at 19:48
AttributeError: 'DataFrame' object has no attribute 'heads', AttributeError: 'DataFrame' object has no attribute 'tails', TypeError: 'Index' object is not callable
– AlK
Sep 26 '16 at 19:51
AttributeError: 'DataFrame' object has no attribute 'heads', AttributeError: 'DataFrame' object has no attribute 'tails', TypeError: 'Index' object is not callable
– AlK
Sep 26 '16 at 19:51
1
1
@AlexanderKonstantinidis ah, it is
tail
, head
and index
- no s
prefix and no ()
(these are not methods).– alecxe
Sep 26 '16 at 19:56
@AlexanderKonstantinidis ah, it is
tail
, head
and index
- no s
prefix and no ()
(these are not methods).– alecxe
Sep 26 '16 at 19:56
add a comment |
Thanks for contributing an answer to Stack Overflow!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f39710903%2fpd-read-html-imports-a-list-rather-than-a-dataframe%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown