Microsoft Fabric Community Conference 2025, March 31 - April 2, Las Vegas, Nevada. Use code MSCUST for a $150 discount.
Register nowThe Power BI DataViz World Championships are on! With four chances to enter, you could win a spot in the LIVE Grand Finale in Las Vegas. Show off your skills.
Greetings group
I have a great concern that I have not been able to resolve. I happen to have a little over 20 GB worth of json. I need to upload them to Power BI to start analyzing their information. It takes an absurd amount of time, and in the end an error comes out.
I want to know what platform you recommend to upload these json, I have them local, but I have tried with sharepoint as well. Or what recommendations do you give me so that this charge does not take almost 14 hours.
Is it faster if I partition more, the json weigh on average 100 megabytes or approximately 1 million rows?
I don't have the $9.99 to buy the license and make a flow in power bi online and feed the dashboard from there
I will be grateful for your help
this is my code whit
let
Origen = SharePoint.Files("", [ApiVersion = 15]),
#"Filas filtradas" = Table.SelectRows(Origen, each Text.Contains([Folder Path], "Archivos Metabase")),
#"Filas filtradas 1" = Table.SelectRows(#"Filas filtradas", each [Folder Path] = "),
#"Otras columnas quitadas" = Table.SelectColumns(#"Filas filtradas 1",{"Content"}),
#"Personalizada agregada" = Table.AddColumn(#"Otras columnas quitadas", "Personalizado", each Json.Document([Content])),
#"Se expandió Personalizado" = Table.ExpandListColumn(#"Personalizada agregada", "Personalizado"),
#"Se expandió Personalizado1" = Table.ExpandRecordColumn(#"Se expandió Personalizado", "Personalizado", {"CodCliente", "Info3", "Status", "FechaMod", "Info6", "Info5", "Info1", "CodCampanna", "Info2", "Info4", "ContadorLlamadas", "Telefono3", "Telefono1", "NombreCliente"}, {"CodCliente", "Info3", "Status", "FechaMod", "Info6", "Info5", "Info1", "CodCampanna", "Info2", "Info4", "ContadorLlamadas", "Telefono3", "Telefono1", "NombreCliente"}),
#"Columnas quitadas" = Table.RemoveColumns(#"Se expandió Personalizado1",{"Content"}),
#"Dividir columna por delimitador" = Table.SplitColumn(#"Columnas quitadas", "FechaMod", Splitter.SplitTextByDelimiter("T", QuoteStyle.Csv), {"Fecha", "Hora"}),
#"Tipo cambiado" = Table.TransformColumnTypes(#"Dividir columna por delimitador",{{"Fecha", type date}}),
#"Valor reemplazado" = Table.ReplaceValue(#"Tipo cambiado","Z","",Replacer.ReplaceText,{"Hora"}),
#"Tipo cambiado1" = Table.TransformColumnTypes(#"Valor reemplazado",{{"Hora", type time}}),
#"Columnas con nombre cambiado" = Table.RenameColumns(#"Tipo cambiado1",{{"CodCampanna", "cod_campanna"}, {"CodCliente", "cod_cliente"}, {"Telefono1", "telefono"}, {"Fecha", "fecha_llamada"}, {"Hora", "hora_llamada"}, {"Info1", "resultado_info1"}, {"Info2", "resultado_info2"}, {"Info3", "resultado_info3"}, {"Info4", "resultado_info4"}, {"Info5", "resultado_info5"}, {"Info6", "resultado_info6"}, {"Status", "status"}, {"NombreCliente", "nombre_cliente"}})
in
#"Columnas con nombre cambiado"
You need to put the data in a database or datalake and access it from there. If your org doesn't want to spend $9.99 to do things with their data, then that'll be an issue. If you're doing this for personal funsies, put it in a local database or something and connect to it there?
User | Count |
---|---|
122 | |
69 | |
67 | |
58 | |
52 |
User | Count |
---|---|
189 | |
96 | |
67 | |
63 | |
53 |