Comparing two dataframe and replacing the column values
I have two dataframe
I need to compare both dataframe and my output should in such a way that if the values a df1 is present in df leave it as it is else it should be replaced by Out. For example values of column in Level_count should be like L1,L1,L1,L2,L2,L2,L2,Out,Out,Out (as L3 and l4 are not in df1) like this same way i need to compare Edu and Occ as well.
This is my desired output
Could anyone help me in solving out this solution.
Thanks in Advance.
python python-3.x
add a comment |
I have two dataframe
I need to compare both dataframe and my output should in such a way that if the values a df1 is present in df leave it as it is else it should be replaced by Out. For example values of column in Level_count should be like L1,L1,L1,L2,L2,L2,L2,Out,Out,Out (as L3 and l4 are not in df1) like this same way i need to compare Edu and Occ as well.
This is my desired output
Could anyone help me in solving out this solution.
Thanks in Advance.
python python-3.x
1
Please provide some input data as text, show us your desired output and your latest attempts. See Minimal, Complete, and Verifiable example.
– jpp
Nov 16 '18 at 10:52
add a comment |
I have two dataframe
I need to compare both dataframe and my output should in such a way that if the values a df1 is present in df leave it as it is else it should be replaced by Out. For example values of column in Level_count should be like L1,L1,L1,L2,L2,L2,L2,Out,Out,Out (as L3 and l4 are not in df1) like this same way i need to compare Edu and Occ as well.
This is my desired output
Could anyone help me in solving out this solution.
Thanks in Advance.
python python-3.x
I have two dataframe
I need to compare both dataframe and my output should in such a way that if the values a df1 is present in df leave it as it is else it should be replaced by Out. For example values of column in Level_count should be like L1,L1,L1,L2,L2,L2,L2,Out,Out,Out (as L3 and l4 are not in df1) like this same way i need to compare Edu and Occ as well.
This is my desired output
Could anyone help me in solving out this solution.
Thanks in Advance.
python python-3.x
python python-3.x
edited Nov 16 '18 at 11:09
Yadhu
asked Nov 16 '18 at 10:47
YadhuYadhu
658
658
1
Please provide some input data as text, show us your desired output and your latest attempts. See Minimal, Complete, and Verifiable example.
– jpp
Nov 16 '18 at 10:52
add a comment |
1
Please provide some input data as text, show us your desired output and your latest attempts. See Minimal, Complete, and Verifiable example.
– jpp
Nov 16 '18 at 10:52
1
1
Please provide some input data as text, show us your desired output and your latest attempts. See Minimal, Complete, and Verifiable example.
– jpp
Nov 16 '18 at 10:52
Please provide some input data as text, show us your desired output and your latest attempts. See Minimal, Complete, and Verifiable example.
– jpp
Nov 16 '18 at 10:52
add a comment |
1 Answer
1
active
oldest
votes
You need:
df2_dict=df2.to_dict(orient='list')
# {'Level_Count': ['L1', 'L2'], 'Edu': ['MBBS', None], 'Occ': ['MBBS1', None]}
for c in df1.columns:
df1[c]=df1[c].apply(lambda x: x if x in df2_dict[c] else 'out')
Output:
Level_Count Edu Occ
0 L1 MBBS MBBS1
1 L1 MBBS MBBS1
2 L1 out out
3 L2 MBBS MBBS1
4 L2 MBBS MBBS1
5 L2 MBBS MBBS1
6 L2 MBBS MBBS1
7 out MBBS MBBS1
8 out out out
9 out MBBS MBBS1
1
Thank you @Sociopath This worked well and good. Thanks a lot
– Yadhu
Nov 16 '18 at 11:39
add a comment |
StackExchange.ifUsing("editor", function () {
StackExchange.using("externalEditor", function () {
StackExchange.using("snippets", function () {
StackExchange.snippets.init();
});
});
}, "code-snippets");
StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "1"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});
function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});
}
});
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53336296%2fcomparing-two-dataframe-and-replacing-the-column-values%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
1 Answer
1
active
oldest
votes
1 Answer
1
active
oldest
votes
active
oldest
votes
active
oldest
votes
You need:
df2_dict=df2.to_dict(orient='list')
# {'Level_Count': ['L1', 'L2'], 'Edu': ['MBBS', None], 'Occ': ['MBBS1', None]}
for c in df1.columns:
df1[c]=df1[c].apply(lambda x: x if x in df2_dict[c] else 'out')
Output:
Level_Count Edu Occ
0 L1 MBBS MBBS1
1 L1 MBBS MBBS1
2 L1 out out
3 L2 MBBS MBBS1
4 L2 MBBS MBBS1
5 L2 MBBS MBBS1
6 L2 MBBS MBBS1
7 out MBBS MBBS1
8 out out out
9 out MBBS MBBS1
1
Thank you @Sociopath This worked well and good. Thanks a lot
– Yadhu
Nov 16 '18 at 11:39
add a comment |
You need:
df2_dict=df2.to_dict(orient='list')
# {'Level_Count': ['L1', 'L2'], 'Edu': ['MBBS', None], 'Occ': ['MBBS1', None]}
for c in df1.columns:
df1[c]=df1[c].apply(lambda x: x if x in df2_dict[c] else 'out')
Output:
Level_Count Edu Occ
0 L1 MBBS MBBS1
1 L1 MBBS MBBS1
2 L1 out out
3 L2 MBBS MBBS1
4 L2 MBBS MBBS1
5 L2 MBBS MBBS1
6 L2 MBBS MBBS1
7 out MBBS MBBS1
8 out out out
9 out MBBS MBBS1
1
Thank you @Sociopath This worked well and good. Thanks a lot
– Yadhu
Nov 16 '18 at 11:39
add a comment |
You need:
df2_dict=df2.to_dict(orient='list')
# {'Level_Count': ['L1', 'L2'], 'Edu': ['MBBS', None], 'Occ': ['MBBS1', None]}
for c in df1.columns:
df1[c]=df1[c].apply(lambda x: x if x in df2_dict[c] else 'out')
Output:
Level_Count Edu Occ
0 L1 MBBS MBBS1
1 L1 MBBS MBBS1
2 L1 out out
3 L2 MBBS MBBS1
4 L2 MBBS MBBS1
5 L2 MBBS MBBS1
6 L2 MBBS MBBS1
7 out MBBS MBBS1
8 out out out
9 out MBBS MBBS1
You need:
df2_dict=df2.to_dict(orient='list')
# {'Level_Count': ['L1', 'L2'], 'Edu': ['MBBS', None], 'Occ': ['MBBS1', None]}
for c in df1.columns:
df1[c]=df1[c].apply(lambda x: x if x in df2_dict[c] else 'out')
Output:
Level_Count Edu Occ
0 L1 MBBS MBBS1
1 L1 MBBS MBBS1
2 L1 out out
3 L2 MBBS MBBS1
4 L2 MBBS MBBS1
5 L2 MBBS MBBS1
6 L2 MBBS MBBS1
7 out MBBS MBBS1
8 out out out
9 out MBBS MBBS1
answered Nov 16 '18 at 11:22
AkshayNevrekarAkshayNevrekar
5,78492040
5,78492040
1
Thank you @Sociopath This worked well and good. Thanks a lot
– Yadhu
Nov 16 '18 at 11:39
add a comment |
1
Thank you @Sociopath This worked well and good. Thanks a lot
– Yadhu
Nov 16 '18 at 11:39
1
1
Thank you @Sociopath This worked well and good. Thanks a lot
– Yadhu
Nov 16 '18 at 11:39
Thank you @Sociopath This worked well and good. Thanks a lot
– Yadhu
Nov 16 '18 at 11:39
add a comment |
Thanks for contributing an answer to Stack Overflow!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53336296%2fcomparing-two-dataframe-and-replacing-the-column-values%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
1
Please provide some input data as text, show us your desired output and your latest attempts. See Minimal, Complete, and Verifiable example.
– jpp
Nov 16 '18 at 10:52