Changes

Jump to: navigation, search

Telemetry/Custom analysis with spark

406 bytes added, 17:51, 11 August 2016
Added FAQ
dataset = sqlContext.read.load("s3://the_bucket/the_prefix/the_version", "parquet")
 
=== I got a REMOTE HOST IDENTIFICATION HAS CHANGED! error ===
AWS recycles hostnames, so removing the offending key from $HOME/.ssh/known_hosts will remove the warning. You can find the line to remove by finding the line in the output that says
Offending key in /path/to/hosts/known_hosts:2
Where 2 is the line number of the key that can be deleted. Just remove that line, save the file, and try again.
29
edits

Navigation menu