Can we avoid using one hot encoding on a categorical data which has only 2 unique values?





.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty,.everyoneloves__bot-mid-leaderboard:empty{ height:90px;width:728px;box-sizing:border-box;
}







-1















I am using an MLP Classifier. There are around 4-5 categorical attributes (such as Gender(Male/Female), Smoking(Yes/No), Diabetes(Yes/No), Hypertension(Yes/No)).



Do I necessarily need to use one hot encoding on all these features before using a neural network classifier? I don't have a lot of training data (only 130 samples).



Can I just get away with Label Encoding these attributes?










share|improve this question

























  • Read the accepted answer here

    – AkshayNevrekar
    Nov 16 '18 at 14:01


















-1















I am using an MLP Classifier. There are around 4-5 categorical attributes (such as Gender(Male/Female), Smoking(Yes/No), Diabetes(Yes/No), Hypertension(Yes/No)).



Do I necessarily need to use one hot encoding on all these features before using a neural network classifier? I don't have a lot of training data (only 130 samples).



Can I just get away with Label Encoding these attributes?










share|improve this question

























  • Read the accepted answer here

    – AkshayNevrekar
    Nov 16 '18 at 14:01














-1












-1








-1








I am using an MLP Classifier. There are around 4-5 categorical attributes (such as Gender(Male/Female), Smoking(Yes/No), Diabetes(Yes/No), Hypertension(Yes/No)).



Do I necessarily need to use one hot encoding on all these features before using a neural network classifier? I don't have a lot of training data (only 130 samples).



Can I just get away with Label Encoding these attributes?










share|improve this question
















I am using an MLP Classifier. There are around 4-5 categorical attributes (such as Gender(Male/Female), Smoking(Yes/No), Diabetes(Yes/No), Hypertension(Yes/No)).



Do I necessarily need to use one hot encoding on all these features before using a neural network classifier? I don't have a lot of training data (only 130 samples).



Can I just get away with Label Encoding these attributes?







python machine-learning scikit-learn






share|improve this question















share|improve this question













share|improve this question




share|improve this question








edited Nov 17 '18 at 0:11









desertnaut

20.6k84379




20.6k84379










asked Nov 16 '18 at 13:58









Gautam PhadkeGautam Phadke

1




1













  • Read the accepted answer here

    – AkshayNevrekar
    Nov 16 '18 at 14:01



















  • Read the accepted answer here

    – AkshayNevrekar
    Nov 16 '18 at 14:01

















Read the accepted answer here

– AkshayNevrekar
Nov 16 '18 at 14:01





Read the accepted answer here

– AkshayNevrekar
Nov 16 '18 at 14:01












1 Answer
1






active

oldest

votes


















1














Of course, that will be enough. There is no gain in using one hot in this case.






share|improve this answer
























    Your Answer






    StackExchange.ifUsing("editor", function () {
    StackExchange.using("externalEditor", function () {
    StackExchange.using("snippets", function () {
    StackExchange.snippets.init();
    });
    });
    }, "code-snippets");

    StackExchange.ready(function() {
    var channelOptions = {
    tags: "".split(" "),
    id: "1"
    };
    initTagRenderer("".split(" "), "".split(" "), channelOptions);

    StackExchange.using("externalEditor", function() {
    // Have to fire editor after snippets, if snippets enabled
    if (StackExchange.settings.snippets.snippetsEnabled) {
    StackExchange.using("snippets", function() {
    createEditor();
    });
    }
    else {
    createEditor();
    }
    });

    function createEditor() {
    StackExchange.prepareEditor({
    heartbeatType: 'answer',
    autoActivateHeartbeat: false,
    convertImagesToLinks: true,
    noModals: true,
    showLowRepImageUploadWarning: true,
    reputationToPostImages: 10,
    bindNavPrevention: true,
    postfix: "",
    imageUploader: {
    brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
    contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
    allowUrls: true
    },
    onDemand: true,
    discardSelector: ".discard-answer"
    ,immediatelyShowMarkdownHelp:true
    });


    }
    });














    draft saved

    draft discarded


















    StackExchange.ready(
    function () {
    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53339292%2fcan-we-avoid-using-one-hot-encoding-on-a-categorical-data-which-has-only-2-uniqu%23new-answer', 'question_page');
    }
    );

    Post as a guest















    Required, but never shown

























    1 Answer
    1






    active

    oldest

    votes








    1 Answer
    1






    active

    oldest

    votes









    active

    oldest

    votes






    active

    oldest

    votes









    1














    Of course, that will be enough. There is no gain in using one hot in this case.






    share|improve this answer




























      1














      Of course, that will be enough. There is no gain in using one hot in this case.






      share|improve this answer


























        1












        1








        1







        Of course, that will be enough. There is no gain in using one hot in this case.






        share|improve this answer













        Of course, that will be enough. There is no gain in using one hot in this case.







        share|improve this answer












        share|improve this answer



        share|improve this answer










        answered Nov 16 '18 at 14:01









        Matthieu BrucherMatthieu Brucher

        17.5k42345




        17.5k42345
































            draft saved

            draft discarded




















































            Thanks for contributing an answer to Stack Overflow!


            • Please be sure to answer the question. Provide details and share your research!

            But avoid



            • Asking for help, clarification, or responding to other answers.

            • Making statements based on opinion; back them up with references or personal experience.


            To learn more, see our tips on writing great answers.




            draft saved


            draft discarded














            StackExchange.ready(
            function () {
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53339292%2fcan-we-avoid-using-one-hot-encoding-on-a-categorical-data-which-has-only-2-uniqu%23new-answer', 'question_page');
            }
            );

            Post as a guest















            Required, but never shown





















































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown

































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown







            Popular posts from this blog

            Xamarin.iOS Cant Deploy on Iphone

            Glorious Revolution

            Dulmage-Mendelsohn matrix decomposition in Python