How to create seperate tables from postgreSQL JSON column value












0















Following is the table format :



CREATE TABLE records
(
id text,
record json
)


For every record column of the row , JSON value will be: (this is just a snapshot of an entire structure)



{
"Trial": {
"primary_id": "xxxxx",
"key1": "aaaaaaaa (BAL-8557) aaaaaaaaaaaaaaaaaa",
"key2": "aaaaaaaaaaaaa , aaaaaaaaaaaaaaaaaaa",
"key3": "Yes",
"key4": "No",
"key5": {
"key5k": [{
"@type": "ABC",
"$": "ABC121 "
},
{
"@type": "ABC",
"$": "ABC12133 "
}
]
},
"Indications": {
"Indication": [{
"indication_id": "1308",
"$": "aaaaaa"
},
{
"indicationid": "1309",
"$": "bbbbbb"
}
]
}
}
}


there are around 300K rows in the table currently



I want to develop an utility that will read all rows find keys which have nested JSON.



Based on the nested JSON structure I want to create a separate table



E.g. from above JSON table , Indication Table will be



----------------------------------------
primary_id | indication_id | value
----------------------------------------
xxxxx | 1308 | aaaaaa
----------------------------------------
xxxxx | 1309 | bbbbbb




How can I develop this utility ?



Should this be handled by PostgreSQL functions or using Python + Spark ?



If someone could share any link for this solution or something similar that will be helpful , thanks










share|improve this question























  • So what have you tried so far? Where are you stuck?

    – Richard Huxton
    Nov 15 '18 at 9:10
















0















Following is the table format :



CREATE TABLE records
(
id text,
record json
)


For every record column of the row , JSON value will be: (this is just a snapshot of an entire structure)



{
"Trial": {
"primary_id": "xxxxx",
"key1": "aaaaaaaa (BAL-8557) aaaaaaaaaaaaaaaaaa",
"key2": "aaaaaaaaaaaaa , aaaaaaaaaaaaaaaaaaa",
"key3": "Yes",
"key4": "No",
"key5": {
"key5k": [{
"@type": "ABC",
"$": "ABC121 "
},
{
"@type": "ABC",
"$": "ABC12133 "
}
]
},
"Indications": {
"Indication": [{
"indication_id": "1308",
"$": "aaaaaa"
},
{
"indicationid": "1309",
"$": "bbbbbb"
}
]
}
}
}


there are around 300K rows in the table currently



I want to develop an utility that will read all rows find keys which have nested JSON.



Based on the nested JSON structure I want to create a separate table



E.g. from above JSON table , Indication Table will be



----------------------------------------
primary_id | indication_id | value
----------------------------------------
xxxxx | 1308 | aaaaaa
----------------------------------------
xxxxx | 1309 | bbbbbb




How can I develop this utility ?



Should this be handled by PostgreSQL functions or using Python + Spark ?



If someone could share any link for this solution or something similar that will be helpful , thanks










share|improve this question























  • So what have you tried so far? Where are you stuck?

    – Richard Huxton
    Nov 15 '18 at 9:10














0












0








0








Following is the table format :



CREATE TABLE records
(
id text,
record json
)


For every record column of the row , JSON value will be: (this is just a snapshot of an entire structure)



{
"Trial": {
"primary_id": "xxxxx",
"key1": "aaaaaaaa (BAL-8557) aaaaaaaaaaaaaaaaaa",
"key2": "aaaaaaaaaaaaa , aaaaaaaaaaaaaaaaaaa",
"key3": "Yes",
"key4": "No",
"key5": {
"key5k": [{
"@type": "ABC",
"$": "ABC121 "
},
{
"@type": "ABC",
"$": "ABC12133 "
}
]
},
"Indications": {
"Indication": [{
"indication_id": "1308",
"$": "aaaaaa"
},
{
"indicationid": "1309",
"$": "bbbbbb"
}
]
}
}
}


there are around 300K rows in the table currently



I want to develop an utility that will read all rows find keys which have nested JSON.



Based on the nested JSON structure I want to create a separate table



E.g. from above JSON table , Indication Table will be



----------------------------------------
primary_id | indication_id | value
----------------------------------------
xxxxx | 1308 | aaaaaa
----------------------------------------
xxxxx | 1309 | bbbbbb




How can I develop this utility ?



Should this be handled by PostgreSQL functions or using Python + Spark ?



If someone could share any link for this solution or something similar that will be helpful , thanks










share|improve this question














Following is the table format :



CREATE TABLE records
(
id text,
record json
)


For every record column of the row , JSON value will be: (this is just a snapshot of an entire structure)



{
"Trial": {
"primary_id": "xxxxx",
"key1": "aaaaaaaa (BAL-8557) aaaaaaaaaaaaaaaaaa",
"key2": "aaaaaaaaaaaaa , aaaaaaaaaaaaaaaaaaa",
"key3": "Yes",
"key4": "No",
"key5": {
"key5k": [{
"@type": "ABC",
"$": "ABC121 "
},
{
"@type": "ABC",
"$": "ABC12133 "
}
]
},
"Indications": {
"Indication": [{
"indication_id": "1308",
"$": "aaaaaa"
},
{
"indicationid": "1309",
"$": "bbbbbb"
}
]
}
}
}


there are around 300K rows in the table currently



I want to develop an utility that will read all rows find keys which have nested JSON.



Based on the nested JSON structure I want to create a separate table



E.g. from above JSON table , Indication Table will be



----------------------------------------
primary_id | indication_id | value
----------------------------------------
xxxxx | 1308 | aaaaaa
----------------------------------------
xxxxx | 1309 | bbbbbb




How can I develop this utility ?



Should this be handled by PostgreSQL functions or using Python + Spark ?



If someone could share any link for this solution or something similar that will be helpful , thanks







python json postgresql pyspark






share|improve this question













share|improve this question











share|improve this question




share|improve this question










asked Nov 15 '18 at 3:58









K.PilK.Pil

10219




10219













  • So what have you tried so far? Where are you stuck?

    – Richard Huxton
    Nov 15 '18 at 9:10



















  • So what have you tried so far? Where are you stuck?

    – Richard Huxton
    Nov 15 '18 at 9:10

















So what have you tried so far? Where are you stuck?

– Richard Huxton
Nov 15 '18 at 9:10





So what have you tried so far? Where are you stuck?

– Richard Huxton
Nov 15 '18 at 9:10












0






active

oldest

votes











Your Answer






StackExchange.ifUsing("editor", function () {
StackExchange.using("externalEditor", function () {
StackExchange.using("snippets", function () {
StackExchange.snippets.init();
});
});
}, "code-snippets");

StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "1"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});

function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});


}
});














draft saved

draft discarded


















StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53312232%2fhow-to-create-seperate-tables-from-postgresql-json-column-value%23new-answer', 'question_page');
}
);

Post as a guest















Required, but never shown

























0






active

oldest

votes








0






active

oldest

votes









active

oldest

votes






active

oldest

votes
















draft saved

draft discarded




















































Thanks for contributing an answer to Stack Overflow!


  • Please be sure to answer the question. Provide details and share your research!

But avoid



  • Asking for help, clarification, or responding to other answers.

  • Making statements based on opinion; back them up with references or personal experience.


To learn more, see our tips on writing great answers.




draft saved


draft discarded














StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53312232%2fhow-to-create-seperate-tables-from-postgresql-json-column-value%23new-answer', 'question_page');
}
);

Post as a guest















Required, but never shown





















































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown

































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown







Popular posts from this blog

Xamarin.iOS Cant Deploy on Iphone

Glorious Revolution

Dulmage-Mendelsohn matrix decomposition in Python