Telemetry/Custom analysis with spark: Difference between revisions

(update moztelemetry documentation link, add FAQ)
Line 43: Line 43:


=== How can I load parquet datasets in a Jupyter notebook? ===
=== How can I load parquet datasets in a Jupyter notebook? ===
Use  
Use sqlContext.read.load, e.g.:
  sqlContext.read.load
e.g.:


   dataset = sqlContext.read.load("s3://telemetry-parquet/e10s_experiment_view/e10s_beta48_cohorts/v20160720_20160727", "parquet")
   dataset = sqlContext.read.load("s3://the_bucket/the_prefix/the_version", "parquet")
29

edits