PinnedHow to lower job costs on EC2 instances and improve performance when using Spark’s persist methodIn this blog, we’ll examine Spark’s persist method and explain how to use it in a performant manner and lower job costs.Mar 27, 2024Mar 27, 2024
PinnedHow to improve performance and lower Spark job costs on EC2 instances when using Spark’s coalesce…Mar 27, 2024Mar 27, 2024
Published inExpedia Group TechnologyPart 6: Summary of Apache Spark Cost Tuning StrategyThe step by step overview of the cost tuning strategyAug 20, 2020Aug 20, 2020
Published inExpedia Group TechnologyPart 5: How to Resolve Common Errors When Switching to Cost Efficient Apache Spark Executor…How to resolve memory issues that happen when switching to efficient executor configsAug 18, 2020Aug 18, 2020
Published inExpedia Group TechnologyPart 4: How to Migrate Existing Apache Spark Jobs to Cost Efficient Executor ConfigurationsSteps to follow when converting existing jobs to cost efficient configAug 13, 2020A response icon4Aug 13, 2020A response icon4
Published inExpedia Group TechnologyPart 3: Cost Efficient Executor Configuration for Apache SparkFind the most efficient executor configuration for your nodeAug 11, 2020A response icon5Aug 11, 2020A response icon5
Published inExpedia Group TechnologyPart 2: Real World Apache Spark Cost Tuning ExamplesI outline the procedure for working through cost tuningAug 6, 2020A response icon1Aug 6, 2020A response icon1
Published inExpedia Group TechnologyPart 1: Cloud Spending Efficiency Guide for Apache Spark on EC2 InstancesHow I saved 60% of costs in an Apache Spark job, with no increase in job time and no decrease in data processedAug 4, 2020A response icon2Aug 4, 2020A response icon2
Published inHomeAway Tech BlogAre You Sure You Have Good Data?Best practices for detecting bad data before it spreadsApr 3, 2019Apr 3, 2019