This pull request makes newer Spark versions with yarn use Hadoop 2.7. #105

lagerspetz · 2017-06-21T11:03:26Z

This pull request makes newer Spark versions with yarn use Hadoop 2.7. The purpose is to make it possible to use the s3a filesystem scheme with spark-ec2 Amazon EC2 deployments.

shivaram · 2017-06-21T18:50:11Z

But doesn't this require the s3a jars to also be downloaded ? Also maybe we can add a new option for this rather than editing the existing one ?

lagerspetz · 2017-06-21T19:08:53Z

I'm not sure if s3a jars need to be in Spark's jars or if it is enough to have them as dependencies of a project wishing to use those URLs. In Carat, we use S3 directly so we have those jars already as dependencies.

In current Spark, s3n urls work; what would we need to make also s3a urls work out of the box?
According to this it should already be in Hadoop 2.7+
https://wiki.apache.org/hadoop/AmazonS3

nchammas · 2017-06-21T22:09:06Z

To use S3A you do need to pull in additional packages. For reference, this is how Flintrock does it:

lagerspetz · 2017-07-13T09:21:49Z

Actually I think pull #56 does this better.

Use Hadoop 2.7 for newer Spark versions with yarn.

4e1a916

lagerspetz force-pushed the use-hadoop-27 branch from 008ee4b to 4e1a916 Compare June 21, 2017 11:09

lagerspetz closed this Jul 13, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

This pull request makes newer Spark versions with yarn use Hadoop 2.7. #105

This pull request makes newer Spark versions with yarn use Hadoop 2.7. #105

lagerspetz commented Jun 21, 2017

shivaram commented Jun 21, 2017

lagerspetz commented Jun 21, 2017 •

edited

Loading

nchammas commented Jun 21, 2017

lagerspetz commented Jul 13, 2017

This pull request makes newer Spark versions with yarn use Hadoop 2.7. #105

This pull request makes newer Spark versions with yarn use Hadoop 2.7. #105

Conversation

lagerspetz commented Jun 21, 2017

shivaram commented Jun 21, 2017

lagerspetz commented Jun 21, 2017 • edited Loading

nchammas commented Jun 21, 2017

lagerspetz commented Jul 13, 2017

lagerspetz commented Jun 21, 2017 •

edited

Loading