Hive JSON Data Types












0















I am looking to confirm my understanding of how complex JSON data types work in Hive. I am ingesting data onto a platform from a source system that stores data in MongoDB, so I am ingesting multiple JSON documents and storing in an object based storage. Finally, I am creating an external hive table using a JSON SerDe that is pointing to the directory which holds the JSON files.



Of course, not every JSON document within each collection has the same exact schema, however in order to create a data type in Hive I need to know the complete, possible, schema... correct?



It seems "obvious" to me that the answer is yes, as JSON is nested, you should have to fully describe the schema in order for Hive to be able to make sense of the data and for things like "lateral view explodes" to work. However, I am somewhat new to Hive so I just want to make sure there is not some feature of which I am unaware that somehow auto detects changes in your JSON schema and updates the data type appropriately.










share|improve this question





























    0















    I am looking to confirm my understanding of how complex JSON data types work in Hive. I am ingesting data onto a platform from a source system that stores data in MongoDB, so I am ingesting multiple JSON documents and storing in an object based storage. Finally, I am creating an external hive table using a JSON SerDe that is pointing to the directory which holds the JSON files.



    Of course, not every JSON document within each collection has the same exact schema, however in order to create a data type in Hive I need to know the complete, possible, schema... correct?



    It seems "obvious" to me that the answer is yes, as JSON is nested, you should have to fully describe the schema in order for Hive to be able to make sense of the data and for things like "lateral view explodes" to work. However, I am somewhat new to Hive so I just want to make sure there is not some feature of which I am unaware that somehow auto detects changes in your JSON schema and updates the data type appropriately.










    share|improve this question



























      0












      0








      0








      I am looking to confirm my understanding of how complex JSON data types work in Hive. I am ingesting data onto a platform from a source system that stores data in MongoDB, so I am ingesting multiple JSON documents and storing in an object based storage. Finally, I am creating an external hive table using a JSON SerDe that is pointing to the directory which holds the JSON files.



      Of course, not every JSON document within each collection has the same exact schema, however in order to create a data type in Hive I need to know the complete, possible, schema... correct?



      It seems "obvious" to me that the answer is yes, as JSON is nested, you should have to fully describe the schema in order for Hive to be able to make sense of the data and for things like "lateral view explodes" to work. However, I am somewhat new to Hive so I just want to make sure there is not some feature of which I am unaware that somehow auto detects changes in your JSON schema and updates the data type appropriately.










      share|improve this question
















      I am looking to confirm my understanding of how complex JSON data types work in Hive. I am ingesting data onto a platform from a source system that stores data in MongoDB, so I am ingesting multiple JSON documents and storing in an object based storage. Finally, I am creating an external hive table using a JSON SerDe that is pointing to the directory which holds the JSON files.



      Of course, not every JSON document within each collection has the same exact schema, however in order to create a data type in Hive I need to know the complete, possible, schema... correct?



      It seems "obvious" to me that the answer is yes, as JSON is nested, you should have to fully describe the schema in order for Hive to be able to make sense of the data and for things like "lateral view explodes" to work. However, I am somewhat new to Hive so I just want to make sure there is not some feature of which I am unaware that somehow auto detects changes in your JSON schema and updates the data type appropriately.







      json hive hive-serde






      share|improve this question















      share|improve this question













      share|improve this question




      share|improve this question








      edited Nov 15 '18 at 17:54









      dtolnay

      3,64711634




      3,64711634










      asked Nov 15 '18 at 11:37









      Nibroc A RehpotsirhcNibroc A Rehpotsirhc

      1299




      1299
























          0






          active

          oldest

          votes











          Your Answer






          StackExchange.ifUsing("editor", function () {
          StackExchange.using("externalEditor", function () {
          StackExchange.using("snippets", function () {
          StackExchange.snippets.init();
          });
          });
          }, "code-snippets");

          StackExchange.ready(function() {
          var channelOptions = {
          tags: "".split(" "),
          id: "1"
          };
          initTagRenderer("".split(" "), "".split(" "), channelOptions);

          StackExchange.using("externalEditor", function() {
          // Have to fire editor after snippets, if snippets enabled
          if (StackExchange.settings.snippets.snippetsEnabled) {
          StackExchange.using("snippets", function() {
          createEditor();
          });
          }
          else {
          createEditor();
          }
          });

          function createEditor() {
          StackExchange.prepareEditor({
          heartbeatType: 'answer',
          autoActivateHeartbeat: false,
          convertImagesToLinks: true,
          noModals: true,
          showLowRepImageUploadWarning: true,
          reputationToPostImages: 10,
          bindNavPrevention: true,
          postfix: "",
          imageUploader: {
          brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
          contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
          allowUrls: true
          },
          onDemand: true,
          discardSelector: ".discard-answer"
          ,immediatelyShowMarkdownHelp:true
          });


          }
          });














          draft saved

          draft discarded


















          StackExchange.ready(
          function () {
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53318633%2fhive-json-data-types%23new-answer', 'question_page');
          }
          );

          Post as a guest















          Required, but never shown

























          0






          active

          oldest

          votes








          0






          active

          oldest

          votes









          active

          oldest

          votes






          active

          oldest

          votes
















          draft saved

          draft discarded




















































          Thanks for contributing an answer to Stack Overflow!


          • Please be sure to answer the question. Provide details and share your research!

          But avoid



          • Asking for help, clarification, or responding to other answers.

          • Making statements based on opinion; back them up with references or personal experience.


          To learn more, see our tips on writing great answers.




          draft saved


          draft discarded














          StackExchange.ready(
          function () {
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53318633%2fhive-json-data-types%23new-answer', 'question_page');
          }
          );

          Post as a guest















          Required, but never shown





















































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown

































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown







          Popular posts from this blog

          Xamarin.iOS Cant Deploy on Iphone

          Glorious Revolution

          Dulmage-Mendelsohn matrix decomposition in Python