• Create
    • Ask a question
    • Create an article
    • Topics
    • Questions
    • Articles
    • Users
    • Badges
  • Sign in

Partition

Cancel

Cancel

All Posts

  • Updated
  • Created
  • Hottest
  • Votes
  • Most viewed

Partition prunning

0 Answers

0 Votes

2 Views

published by avatar image nhufas on 7 hours ago
parquet·partition

Equal size batches creation in spark

0 Answers

0 Votes

89 Views

published by avatar image sd.hrishi on Mar 21, '18
spark·spark-sql·rdd·partition·batch-learning

Spark 2.0 Streaming breaking partition-by partition executor assignment

1 Answer

1 Votes

237 Views

answered by avatar image brandonsc on Nov 23, '17
spark streaming·spark 2.0·partition·hashpartitioning·partition-by
brandonsc

Are there any best practices for working with the same expensive join repeatedly?

1 Answer

0 Votes

1.3k Views

answered by avatar image madhav on Aug 9, '17
cache·partition·query·broadcasthashjoin·persist

Persisted RDD partitions getting cleared automatically. How to determine if persisted RDD is still in memory and/or on disk ?

-2 Answers

0 Votes

1.4k Views

answered by avatar image 4gs on May 11, '17
partitioning·cache·partition·persistence·persist

Mapping kafka partition to a specific spark executor

0 Answers

1 Votes

410 Views

published by avatar image Sami Ouassaidi on Mar 16, '17
kafka·map·partition·executor·spark kafka
onkar1712

Why foreachPartition function make duplicate invocation to map function for every message ? (Spark 2.0.2)

0 Answers

0 Votes

761 Views

commented by avatar image srirocky on Feb 24, '17
spark streaming·api·partition·foreachpartition·spark data stream

Spark Performance Issue (Partition too small ?)

1 Answer

0 Votes

1.2k Views

answered by avatar image jason on Jun 24, '16
spark·performance·partition

Not able to partition table on Hive- Error in metadata

1 Answer

0 Votes

2.1k Views

answered by avatar image Arunkumar on Dec 4, '15
hive·hadoop·partitioning·partition

Is it possible to use a partition of data (different files) ​in Python as is shown in Scala ?

1 Answer

0 Votes

668 Views

answered by avatar image cfregly on Apr 12, '15
partition

How many partitions should I use for a particular dataset?

1 Answer

0 Votes

1.1k Views

answered by avatar image cfregly on Mar 21, '15
parallelism·partition·core
23 Posts
16 Users
0 Followers

Topic Experts

There are no topic experts for this topic. Participate in the posts in this topic to earn reputation and become an expert.

Related Topics

partitioning spark spark streaming persist cache parallelism hive kafka core map s3 persistence spark 2.0 sql performance partition-by partitions hashpartitioning table spark-sql query spark data stream rdd api batch-learning
  • Product
    • Databricks Cloud
    • FAQ
  • Spark
    • About Spark
    • Developer Resources
    • Community + Events
  • Services
    • Certification
    • Spark Support
    • Spark Training
  • Company
    • About Us
    • Team
    • News
    • Contact
  • Careers
  • Blog

Databricks Inc.
160 Spear Street, 13th Floor
San Francisco, CA 94105

info@databricks.com
1-866-330-0121

  • Twitter
  • LinkedIn
  • Facebook
  • Facebook

© Databricks 2015. All rights reserved. Apache Spark and the Apache Spark Logo are trademarks of the Apache Software Foundation.

  • Anonymous
  • Sign in
  • Create
  • Ask a question
  • Create an article
  • Explore
  • Topics
  • Questions
  • Articles
  • Users
  • Badges