Details for this torrent 

Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)
Type:
Other > Other
Files:
898
Size:
9.48 GiB (10181696515 Bytes)
Uploaded:
2022-01-27 13:11:53 GMT
By:
tuts756 Trusted
Seeders:
4
Leechers:
3
Comments
0  

Info Hash:
ECC71C1CB0429A5F1314E9C86925DEC7041C98D8




(Problems with magnets links are fixed by upgrading your torrent client!)
Description

As part of this course, you will learn all the key skills to build Data Engineering Pipelines using Spark SQL and Spark Data Frame APIs using Python as a Programming language. This course used to be a CCA 175 Spark and Hadoop Developer course for the preparation of the Certification Exam. As of 10/31/2021, the exam is sunset and we have renamed it to Apache Spark 2 and 3 using Python 3 as it covers industry-relevant topics beyond the scope of certification.

About Data Engineering

Data Engineering is nothing but processing the data depending upon our downstream needs. We need to build different pipelines such as Batch Pipelines, Streaming Pipelines, etc as part of Data Engineering. All roles related to Data Processing are consolidated under Data Engineering. Conventionally, they are known as ETL Development, Data Warehouse Development, etc. Apache Spark is evolved as a leading technology to take care of Data Engineering at scale.

I have prepared this course for anyone who would like to transition into a Data Engineer role using Pyspark (Python + Spark). I myself am a proven Data Engineering Solution Architect with proven experience in designing solutions using Apache Spark.

Let us go through the details about what you will be learning in this course. Keep in mind that the course is created with a lot of hands-on tasks which will give you enough practice using the right tools. Also, there are tons of tasks and exercises to evaluate yourself.

Setup of Single Node Big Data Cluster

Many of you would like to transition to Big Data from Conventional Technologies such as Mainframes, Oracle PL/SQL, etc and you might not have access to Big Data Clusters. It is very important for you set up the environment in the right manner. Don’t worry if you do not have the cluster handy, we will guide you through with support via Udemy Q&A.

    Setup Ubuntu based AWS Cloud9 Instance with right configuration
    Ensure Docker is setup
    Setup Jupyter Lab and other key components
    Setup and Validate Hadoop, Hive, YARN and Spark

A quick recap of Python

This course requires a decent knowledge of Python. To make sure you understand Spark from a Data Engineering perspective, we added a module to quickly warm up with Python. If you are not familiar about Python, then we suggest you to go through our other course Data Engineering Essentials – Python, SQL and Spark.

Data Engineering using Spark SQL

Let us, deep-dive into Spark SQL to understand how it can be used to build Data Engineering Pipelines. Spark with SQL will provide us the ability to leverage distributed computing capabilities of Spark coupled with easy-to-use developer-friendly SQL-style syntax.

    Getting Started with Spark SQL
    Basic Transformations using Spark SQL
    Managing Spark Metastore Tables – Basic DDL and DML
    Managing Spark Metastore Tables Tables – DML and Partitioning
    Overview of Spark SQL Functions
    Windowing Functions using Spark SQL

Data Engineering using Spark Data Frame APIs

Spark Data Frame APIs are an alternative way of building Data Engineering applications at scale leveraging distributed computing capabilities of Spark. Data Engineers from application development backgrounds might prefer Data Frame APIs over Spark SQL to build Data Engineering applications.

    Data Processing Overview using Spark Data Frame APIs
    Processing Column Data using Spark Data Frame APIs
    Basic Transformations using Spark Data Frame APIs – Filtering, Aggregations, and Sorting
    Joining Data Sets using Spark Data Frame APIs
    Windowing Functions using Spark Data Frame APIs – Aggregations, Ranking, and Analytic Functions
    Spark Metastore Databases and Tables

Apache Spark Application Development and Deployment Life Cycle

As Apache Spark based Data Engineers we should be familiar about Application Development and Deployment Lifecycle. As part of this section you will learn the complete life cycle of Development and Deployment Life cycle. It includes but not limited to productionizing the code, externalizing the properties, reviewing the details of Spark Jobs and many more.

    Apache Spark Application Development Lifecycle
    Spark Application Execution Life Cycle and Spark UI
    Setup SSH Proxy to access Spark Application logs
    Deployment Modes of Spark Applications
    Passing Application Properties Files and External Dependencies

All the demos are given on our state of the art Big Data cluster. You can avail one-month complimentary lab access by reaching out to [email protected] with Udemy receipt.
Who this course is for:

    Any IT aspirant/professional willing to learn Data Engineering using Apache Spark
    Python Developers who want to learn Spark to add the key skill to be a Data Engineer

Requirements

    Basic programming skills using any programming language
    Self support lab (Instructions provided) or ITVersity lab at additional cost for appropriate environment.
    Minimum memory required based on the environment you are using with 64 bit operating system
    4 GB RAM with access to proper clusters or 16 GB RAM with virtual machines such as Cloudera QuickStart VM

Last Updated 1/2022

[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/08 - Apache Spark 2.x - Data processing - Getting Started/009 Apache Spark - Framework.mp4200.34 MiB
TutsNode.com.txt63 B
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/08 - Apache Spark 2.x - Data processing - Getting Started/009 Apache Spark - Framework_it.vtt32.76 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/08 - Apache Spark 2.x - Data processing - Getting Started/010 Create Data Frames from Text Files_it.vtt21.45 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/01 - Introduction/external-assets-links.txt146 B
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/13 - Apache Spark using SQL - Pre-defined Functions/005 Date Manipulation Functions_it.vtt20.53 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/08 - Apache Spark 2.x - Data processing - Getting Started/008 Develop Simple Application_it.vtt19.43 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/08 - Apache Spark 2.x - Data processing - Getting Started/007 Spark Data Structures - RDDs and Data Frames_it.vtt19.3 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/23 - Deployment Modes of Spark Applications/013 Submit Spark Applications using SPARK_CONF_DIR with custom properties files_it.vtt18.11 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/08 - Apache Spark 2.x - Data processing - Getting Started/005 Quick Review of Spark APIs_it.vtt17.8 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/17 - Apache Spark using Python - Basic Transformations/010 Total Aggregations_it.vtt17.42 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/07 - Overview of Hadoop HDFS Commands/004 Copying files from local to HDFS_it.vtt16.94 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/02 - Setting up Environment using AWS Cloud9/004 Overview of EC2 related to Cloud9_it.vtt2.23 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/17 - Apache Spark using Python - Basic Transformations/014 Overview of Sorting Data Frames_it.vtt16.71 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/23 - Deployment Modes of Spark Applications/002 Default Execution Master Type for Spark Applications_it.vtt16.07 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/07 - Overview of Hadoop HDFS Commands/003 Managing HDFS Directories_it.vtt16.06 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/21 - Spark Application Execution Life Cycle and Spark UI/009 Develop Word Count Application_it.vtt15.76 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/19 - Apache Spark using Python - Spark Metastore/003 Creating Metastore Tables using catalog_it.vtt15.68 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/23 - Deployment Modes of Spark Applications/010 Changing Spark Application Log Level using custom log4j properties_it.vtt15.65 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/14 - Apache Spark SQL - Windowing Functions/007 Ranking using Windowing Functions_it.vtt15.37 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/17 - Apache Spark using Python - Basic Transformations/005 Boolean Operators_it.vtt14.96 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/02 - Setting up Environment using AWS Cloud9/009 [Commands] Setup Jupyter Lab on Cloud9.html622 B
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/21 - Spark Application Execution Life Cycle and Spark UI/008 Setup Data Set for Word Count application_it.vtt14.91 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/10 - Apache Spark using SQL - Basic Transformations using Spark SQL/009 Aggregating Data_it.vtt14.9 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/23 - Deployment Modes of Spark Applications/004 Running Spark Applications using Local Mode_it.vtt14.83 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/24 - Passing Application Properties Files and External Dependencies/009 Pass Application Properties to Spark Application using HDFS files_it.vtt14.82 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/13 - Apache Spark using SQL - Pre-defined Functions/004 String Manipulation Functions_it.vtt14.73 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/18 - Apache Spark using Python - Joining Data Sets/006 Left or Right Outer Join_it.vtt14.68 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/15 - Apache Spark using Python - Data Processing Overview/002 Overview of Spark Read APIs_it.vtt14.66 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/23 - Deployment Modes of Spark Applications/003 Launch Pyspark using local mode_it.vtt14.65 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/15 - Apache Spark using Python - Data Processing Overview/008 Overview of Spark Write APIs_it.vtt14.64 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/23 - Deployment Modes of Spark Applications/014 Submit Spark Applications using Properties File_it.vtt14.57 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/03 - Setting up Environment - Overview of GCP and Provision Ubuntu VM/007 Setup Jupyter Lab_it.vtt14.5 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/14 - Apache Spark SQL - Windowing Functions/004 Aggregations using Windowing Functions_it.vtt14.42 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/10 - Apache Spark using SQL - Basic Transformations using Spark SQL/006 Filtering Data_it.vtt14 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/06 - Python Fundamentals/007 Basic IO operations and processing data using Collections_it.vtt13.92 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/03 - Setting up Environment - Overview of GCP and Provision Ubuntu VM/external-assets-links.txt859 B
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/11 - Apache Spark using SQL - Basic DDL and DML/002 Create Spark Metastore Tables_it.vtt13.87 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/05 - Setup Hive and Spark on Single Node Cluster/019 Validate Jupyter Lab Setup_it.vtt13.84 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/14 - Apache Spark SQL - Windowing Functions/005 Using LEAD or LAG_it.vtt13.78 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/22 - Setup SSH Proxy to access Spark Application logs/011 Configure Passwordless Login using PuTTYGen Keys on Windows_it.vtt13.7 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/04 - Setup Hadoop on Single Node Cluster/003 [Commands] - Setup Prerequisties.html474 B
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/23 - Deployment Modes of Spark Applications/006 Accessing Local Files using Spark CLI or Spark Applications_it.vtt13.47 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/09 - Apache Spark using SQL - Getting Started/007 Managing Spark Metastore Databases_it.vtt13.32 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/04 - Setup Hadoop on Single Node Cluster/005 [Commands] - Setup Password less login.html95 B
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/19 - Apache Spark using Python - Spark Metastore/008 Create Partitioned Tables_it.vtt13.26 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/19 - Apache Spark using Python - Spark Metastore/011 Using Spark SQL_it.vtt13.09 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/04 - Setup Hadoop on Single Node Cluster/007 [Commands] - Download and Install Hadoop.html1.07 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/20 - Apache Spark - Application Development Life Cycle/015 Detailed Validation of the Application_it.vtt13.05 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/17 - Apache Spark using Python - Basic Transformations/011 Aggregate data using groupBy_it.vtt12.92 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/04 - Setup Hadoop on Single Node Cluster/009 [Commands] - Configure Hadoop HDFS.html2.47 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/20 - Apache Spark - Application Development Life Cycle/014 Running the application using YARN_it.vtt12.89 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/18 - Apache Spark using Python - Joining Data Sets/external-assets-links.txt1.79 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/04 - Setup Hadoop on Single Node Cluster/011 [Commands] - Start and Validate HDFS.html1.13 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/22 - Setup SSH Proxy to access Spark Application logs/002 Overview of Private and Public ips of servers in the cluster_it.vtt12.82 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/04 - Setup Hadoop on Single Node Cluster/012 Configure Hadoop YARN_it.vtt1.77 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/04 - Setup Hadoop on Single Node Cluster/013 [Commands] - Configure Hadoop YARN.html1.33 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/21 - Spark Application Execution Life Cycle and Spark UI/017 Overview of Spark History Server UI_it.vtt12.55 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/04 - Setup Hadoop on Single Node Cluster/014 Start and Validate YARN_it.vtt2.92 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/04 - Setup Hadoop on Single Node Cluster/015 [Commands] - Start and Validate YARN.html645 B
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/07 - Overview of Hadoop HDFS Commands/012 HDFS File Permissions_it.vtt12.54 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/16 - Apache Spark using Python - Processing Column Data/006 Extracting Strings using substring_it.vtt12.52 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/04 - Setup Hadoop on Single Node Cluster/017 [Commands] - Managing Single Node Hadoop.html1.11 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/06 - Python Fundamentals/004 Python Collections_it.vtt12.48 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/17 - Apache Spark using Python - Basic Transformations/004 Filtering Example using dates_it.vtt12.47 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/05 - Setup Hive and Spark on Single Node Cluster/002 [Commands] - Setup Data Sets for Practice.html1.1 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/11 - Apache Spark using SQL - Basic DDL and DML/003 Overview of Data Types_it.vtt12.46 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/05 - Setup Hive and Spark on Single Node Cluster/015 Configure Spark 2_it.vtt12.4 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/05 - Setup Hive and Spark on Single Node Cluster/004 [Commands] - Download and Install Hive.html1.45 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/23 - Deployment Modes of Spark Applications/005 Overview of Spark CLI Commands such as Pyspark_it.vtt12.36 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/22 - Setup SSH Proxy to access Spark Application logs/010 Quick Tour of PuTTY on Windows_it.vtt12.33 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/05 - Setup Hive and Spark on Single Node Cluster/006 [Commands] - Setup Database for Hive Metastore.html2.63 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/16 - Apache Spark using Python - Processing Column Data/014 Using to_date and to_timestamp_it.vtt12.22 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/18 - Apache Spark using Python - Joining Data Sets/002 Analyze Datasets for Joins_it.vtt12.18 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/05 - Setup Hive and Spark on Single Node Cluster/008 [Commands] - Configure and Setup Hive Metastore.html2 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/09 - Apache Spark using SQL - Getting Started/004 Overview of Spark SQL Properties_it.vtt12.09 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/08 - Apache Spark 2.x - Data processing - Getting Started/002 Review of Setup Steps for Spark Environment_it.vtt12.04 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/05 - Setup Hive and Spark on Single Node Cluster/010 [Commands] - Launch and Validate Hive.html694 B
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/20 - Apache Spark - Application Development Life Cycle/008 Read data from files_it.vtt11.69 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/05 - Setup Hive and Spark on Single Node Cluster/017 Validate Spark 2 using CLIs_it.vtt11.62 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/05 - Setup Hive and Spark on Single Node Cluster/012 [Commands] - Scripts to Manage Single Node Cluster.html1.03 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/18 - Apache Spark using Python - Joining Data Sets/005 Using Inner Joins_it.vtt11.6 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/13 - Apache Spark using SQL - Pre-defined Functions/006 Overview of Numeric Functions_it.vtt11.56 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/05 - Setup Hive and Spark on Single Node Cluster/014 [Commands] - Download and Install Spark 2.html1.31 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/06 - Python Fundamentals/003 Functions in Python_it.vtt11.55 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/05 - Setup Hive and Spark on Single Node Cluster/027 Validate Spark 3 using CLIs_it.vtt11.54 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/05 - Setup Hive and Spark on Single Node Cluster/016 [Commands] - Configure Spark 2.html2.1 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/09 - Apache Spark using SQL - Getting Started/011 Exercise - Getting Started with Spark SQL_it.vtt11.4 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/03 - Setting up Environment - Overview of GCP and Provision Ubuntu VM/004 Provision Ubuntu VM from GCP_it.vtt11.37 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/05 - Setup Hive and Spark on Single Node Cluster/018 [Commands] - Validate Spark 2 using CLIs.html1.41 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/03 - Setting up Environment - Overview of GCP and Provision Ubuntu VM/005 Setup Docker_it.vtt11.33 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/21 - Spark Application Execution Life Cycle and Spark UI/005 Understanding YARN Capacity of an Enterprise Cluster_it.vtt11.33 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/05 - Setup Hive and Spark on Single Node Cluster/020 [Commands] - Validate Jupyter Lab Setup.html1.04 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/18 - Apache Spark using Python - Joining Data Sets/36816926-airport-codes-na.txt11.14 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/06 - Python Fundamentals/005 Map Reduce operations on Python Collections_it.vtt11.14 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/05 - Setup Hive and Spark on Single Node Cluster/022 [Commands] - Integrate Spark 2 with Jupyter Lab.html1.86 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/17 - Apache Spark using Python - Basic Transformations/003 Basic Filtering of Data_it.vtt11.13 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/05 - Setup Hive and Spark on Single Node Cluster/023 Download and Install Spark 3_it.vtt2.65 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/05 - Setup Hive and Spark on Single Node Cluster/024 [Commands] - Download and Install Spark 3.html1.32 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/21 - Spark Application Execution Life Cycle and Spark UI/015 Develop Shell Script to run Spark Application_it.vtt11.08 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/22 - Setup SSH Proxy to access Spark Application logs/016 Update Hosts file for worker nodes using private ips_it.vtt10.81 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/05 - Setup Hive and Spark on Single Node Cluster/026 [Commands] - Configure Spark 3.html2.29 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/22 - Setup SSH Proxy to access Spark Application logs/003 Overview of SSH Proxy_it.vtt10.8 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/16 - Apache Spark using Python - Processing Column Data/007 Extracting Strings using split_it.vtt10.77 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/05 - Setup Hive and Spark on Single Node Cluster/028 [Commands] - Validate Spark 3 using CLIs.html1.41 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/06 - Python Fundamentals/002 Basic Programming Constructs_it.vtt10.69 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/17 - Apache Spark using Python - Basic Transformations/015 Solution - Problem 1 - Get Total Aggregations_it.vtt10.67 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/05 - Setup Hive and Spark on Single Node Cluster/030 [Commands] - Integrate Spark 3 with Jupyter Lab.html1.87 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/13 - Apache Spark using SQL - Pre-defined Functions/008 Dealing with Nulls_it.vtt10.62 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/17 - Apache Spark using Python - Basic Transformations/012 Aggregate data using rollup_it.vtt10.58 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/16 - Apache Spark using Python - Processing Column Data/001 Overview of Predefined Functions in Spark_it.vtt10.54 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/07 - Overview of Hadoop HDFS Commands/006 Getting File Metadata_it.vtt10.5 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/05 - Setup Hive and Spark on Single Node Cluster/005 Setup Database for Hive Metastore_it.vtt10.43 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/02 - Setting up Environment using AWS Cloud9/008 Setup Jupyter Lab on Cloud9_it.vtt10.37 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/23 - Deployment Modes of Spark Applications/012 Submit Spark Application using cluster mode with log level info_it.vtt10.37 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/07 - Overview of Hadoop HDFS Commands/009 HDFS Replication Factor_it.vtt10.37 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/05 - Setup Hive and Spark on Single Node Cluster/007 Configure and Setup Hive Metastore_it.vtt10.33 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/20 - Apache Spark - Application Development Life Cycle/011 Validating Writing Data to Files_it.vtt10.33 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/19 - Apache Spark using Python - Spark Metastore/004 Inferring Schema for Tables_it.vtt10.28 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/10 - Apache Spark using SQL - Basic Transformations using Spark SQL/008 Joining Tables - Outer_it.vtt10.27 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/22 - Setup SSH Proxy to access Spark Application logs/015 Understanding Proxying Network Traffic overcoming Windows Caveats_it.vtt10.22 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/16 - Apache Spark using Python - Processing Column Data/018 Using CASE and WHEN_it.vtt10.18 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/06 - Python Fundamentals/external-assets-links.txt931 B
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/20 - Apache Spark - Application Development Life Cycle/010 Write data to files_it.vtt10.15 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/15 - Apache Spark using Python - Data Processing Overview/001 Starting Spark Context - pyspark_it.vtt10.14 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/11 - Apache Spark using SQL - Basic DDL and DML/010 Overview of File Formats_it.vtt9.92 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/08 - Apache Spark 2.x - Data processing - Getting Started/004 Apache Spark Official Documentation (Very Important)_it.vtt9.9 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/04 - Setup Hadoop on Single Node Cluster/008 Configure Hadoop HDFS_it.vtt9.89 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/24 - Passing Application Properties Files and External Dependencies/008 Pass Application Properties to Spark Application using local files in cluster mo_it.vtt9.88 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/15 - Apache Spark using Python - Data Processing Overview/006 Overview of Data Frame APIs_it.vtt9.8 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/08 - Apache Spark 2.x - Data processing - Getting Started/003 Using ITVersity labs_it.vtt9.77 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/19 - Apache Spark using Python - Spark Metastore/005 Define Schema for Tables using StructType_it.vtt9.71 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/12 - Apache Spark using SQL - DML and Partitioning/008 Loading Data into Partitioned Tables_it.vtt9.7 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/17 - Apache Spark using Python - Basic Transformations/013 Aggregate data using cube.cmproj_it.vtt9.61 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/13 - Apache Spark using SQL - Pre-defined Functions/009 Using CASE and WHEN_it.vtt9.6 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/19 - Apache Spark using Python - Spark Metastore/001 Overview of Spark Metastore_it.vtt9.55 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/10 - Apache Spark using SQL - Basic Transformations using Spark SQL/007 Joining Tables - Inner_it.vtt9.53 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/22 - Setup SSH Proxy to access Spark Application logs/014 Setup Proxy on Windows and validate using Microsoft Edge browser_it.vtt9.53 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/06 - Python Fundamentals/001 Introduction and Setting up Python_it.vtt9.51 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/14 - Apache Spark SQL - Windowing Functions/006 Getting first and last values_it.vtt9.49 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/10 - Apache Spark using SQL - Basic Transformations using Spark SQL/002 Spark SQL - Overview_it.vtt9.48 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/05 - Setup Hive and Spark on Single Node Cluster/025 Configure Spark 3_it.vtt9.42 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/15 - Apache Spark using Python - Data Processing Overview/005 Previewing Airlines Data_it.vtt9.27 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/24 - Passing Application Properties Files and External Dependencies/002 Steps to pass application properties using JSON_it.vtt9.24 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/07 - Overview of Hadoop HDFS Commands/011 Using HDFS Stat Commands_it.vtt2.08 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/19 - Apache Spark using Python - Spark Metastore/006 Inserting into Existing Tables_it.vtt9.21 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/11 - Apache Spark using SQL - Basic DDL and DML/013 Exercise - Managed Tables_it.vtt9.19 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/16 - Apache Spark using Python - Processing Column Data/004 Special Functions - col and lit_it.vtt9.16 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/13 - Apache Spark using SQL - Pre-defined Functions/010 Query Example - Word Count_it.vtt9.15 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/07 - Overview of Hadoop HDFS Commands/external-assets-links.txt1.43 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/15 - Apache Spark using Python - Data Processing Overview/004 Inferring Schema_it.vtt9.09 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/24 - Passing Application Properties Files and External Dependencies/007 Pass Application Properties to Spark Application using local files in client mod_it.vtt9.05 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/21 - Spark Application Execution Life Cycle and Spark UI/003 Setup EMR Cluster with Hadoop and Spark_it.vtt9.03 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/04 - Setup Hadoop on Single Node Cluster/010 Start and Validate HDFS_it.vtt9.02 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/20 - Apache Spark - Application Development Life Cycle/009 Process data using Spark APIs_it.vtt9.01 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/14 - Apache Spark SQL - Windowing Functions/008 Order of execution of SQL_it.vtt8.91 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/16 - Apache Spark using Python - Processing Column Data/016 Dealing with Unix Timestamp_it.vtt8.79 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/16 - Apache Spark using Python - Processing Column Data/015 Using date_format Function_it.vtt8.78 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/05 - Setup Hive and Spark on Single Node Cluster/021 Integrate Spark 2 with Jupyter Lab_it.vtt8.74 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/16 - Apache Spark using Python - Processing Column Data/017 Dealing with Nulls_it.vtt8.73 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/18 - Apache Spark using Python - Joining Data Sets/001 Prepare Datasets for Joins_it.vtt8.67 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/07 - Overview of Hadoop HDFS Commands/013 Overriding Properties_it.vtt8.52 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/07 - Overview of Hadoop HDFS Commands/007 Previewing Data in HDFS File_it.vtt8.5 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/01 - Introduction/001 CCA 175 Spark and Hadoop Developer - Curriculum_it.vtt8.49 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/03 - Setting up Environment - Overview of GCP and Provision Ubuntu VM/003 Overview of GCP Pricing_it.vtt8.48 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/17 - Apache Spark using Python - Basic Transformations/006 Using IN Operator or isin Function_it.vtt8.43 KiB
[TGx]Downloaded from torrentgalaxy.to .txt585 B
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/20 - Apache Spark - Application Development Life Cycle/003 Passing Run Time Arguments_it.vtt8.43 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/19 - Apache Spark using Python - Spark Metastore/007 Read and Process data from Metastore Tables_it.vtt8.33 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/08 - Apache Spark 2.x - Data processing - Getting Started/011 Create Data Frames from Hive Tables_it.vtt8.12 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/24 - Passing Application Properties Files and External Dependencies/017 Pass External Dependencies to Spark Application using HDFS files_it.vtt8.08 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/23 - Deployment Modes of Spark Applications/011 Submit Spark Application using client mode with log level info_it.vtt7.95 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/08 - Apache Spark 2.x - Data processing - Getting Started/external-assets-links.txt2.06 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/24 - Passing Application Properties Files and External Dependencies/013 Explore APIs to process YAML Data using Pyspark_it.vtt7.94 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/09 - Apache Spark using SQL - Getting Started/001 Getting Started - Overview_it.vtt2.69 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/16 - Apache Spark using Python - Processing Column Data/012 Using Date and Time Trunc Functions_it.vtt7.9 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/21 - Spark Application Execution Life Cycle and Spark UI/013 Switch between Pyspark Versions to run Spark Applications or launch Pyspark CLI_it.vtt7.9 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/18 - Apache Spark using Python - Joining Data Sets/003 Problem Statements for Joins_it.vtt2.51 KiB
.pad/03 B
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/06 - Python Fundamentals/007 Basic IO operations and processing data using Collections.mp4184.76 MiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/11 - Apache Spark using SQL - Basic DDL and DML/006 Loading Data Into Tables - HDFS_it.vtt7.86 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/20 - Apache Spark - Application Development Life Cycle/006 Create Function for Spark Session_it.vtt7.78 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/14 - Apache Spark SQL - Windowing Functions/010 Filtering Window Function Results_it.vtt7.75 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/07 - Overview of Hadoop HDFS Commands/005 Copying files from HDFS to local_it.vtt7.74 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/16 - Apache Spark using Python - Processing Column Data/008 Padding Characters around Strings_it.vtt7.72 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/17 - Apache Spark using Python - Basic Transformations/009 Dealing with Nulls while Filtering_it.vtt7.71 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/17 - Apache Spark using Python - Basic Transformations/008 Using BETWEEN Operator_it.vtt7.66 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/21 - Spark Application Execution Life Cycle and Spark UI/004 Overall Capacity of Big Data Cluster with Hadoop and Spark_it.vtt7.64 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/16 - Apache Spark using Python - Processing Column Data/002 Create Dummy Data Frame_it.vtt7.62 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/24 - Passing Application Properties Files and External Dependencies/015 Pass External Dependencies to Spark Application using local files in client mode_it.vtt7.6 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/20 - Apache Spark - Application Development Life Cycle/002 Getting Started with Pycharm_it.vtt7.59 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/17 - Apache Spark using Python - Basic Transformations/007 Using LIKE Operator or like Function_it.vtt7.58 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/24 - Passing Application Properties Files and External Dependencies/016 Pass External Dependencies to Spark Apps using local files in cluster mode_it.vtt7.57 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/04 - Setup Hadoop on Single Node Cluster/002 Setup Prerequisties_it.vtt7.54 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/19 - Apache Spark using Python - Spark Metastore/009 Saving as Partitioned Table_it.vtt7.49 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/09 - Apache Spark using SQL - Getting Started/external-assets-links.txt1.47 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/22 - Setup SSH Proxy to access Spark Application logs/017 Access Spark Application logs using SSH Proxy_it.vtt7.43 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/08 - Apache Spark 2.x - Data processing - Getting Started/006 Spark Modules_it.vtt7.37 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/16 - Apache Spark using Python - Processing Column Data/009 Trimming Characters from Strings_it.vtt7.34 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/23 - Deployment Modes of Spark Applications/009 Review the default logging while submitting Spark Applications_it.vtt7.34 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/16 - Apache Spark using Python - Processing Column Data/011 Date and Time Arithmetic_it.vtt7.32 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/22 - Setup SSH Proxy to access Spark Application logs/005 Proxy using sshuttle on Mac or Linux_it.vtt7.3 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/05 - Setup Hive and Spark on Single Node Cluster/009 Launch and Validate Hive_it.vtt7.17 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/22 - Setup SSH Proxy to access Spark Application logs/006 Accessing Spark Application logs via SSH Proxy using sshuttle on Mac or Linux_it.vtt7.14 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/10 - Apache Spark using SQL - Basic Transformations using Spark SQL/004 Prepare Tables_it.vtt6.92 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/05 - Setup Hive and Spark on Single Node Cluster/011 Scripts to Manage Single Node Cluster_it.vtt6.91 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/20 - Apache Spark - Application Development Life Cycle/012 Productionizing the Code_it.vtt6.9 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/18 - Apache Spark using Python - Joining Data Sets/014 Solution - Get Daily Revenue rolled up till Yearly_it.vtt6.9 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/15 - Apache Spark using Python - Data Processing Overview/007 Overview of Functions_it.vtt6.88 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/02 - Setting up Environment using AWS Cloud9/002 Creating Cloud9 Environment_it.vtt6.82 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/19 - Apache Spark using Python - Spark Metastore/010 Creating Temporary Views_it.vtt6.73 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/20 - Apache Spark - Application Development Life Cycle/001 Setup Virtual Environment and Install Pyspark_it.vtt6.72 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/05 - Setup Hive and Spark on Single Node Cluster/001 Setup Data Sets for Practice_it.vtt6.72 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/09 - Apache Spark using SQL - Getting Started/010 Role of Spark Metastore or Hive Metastore_it.vtt6.7 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/18 - Apache Spark using Python - Joining Data Sets/009 Solution - Get Dormant US Airports_it.vtt5.2 KiB
.pad/13 B
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/08 - Apache Spark 2.x - Data processing - Getting Started/007 Spark Data Structures - RDDs and Data Frames.mp4133.34 MiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/07 - Overview of Hadoop HDFS Commands/008 HDFS Block Size_it.vtt6.69 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/10 - Apache Spark using SQL - Basic Transformations using Spark SQL/external-assets-links.txt1.25 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/24 - Passing Application Properties Files and External Dependencies/001 Passing Application Properties Files and External Dependencies - Introduction_it.vtt6.65 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/18 - Apache Spark using Python - Joining Data Sets/010 Solution - Get Origins without master data_it.vtt6.62 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/24 - Passing Application Properties Files and External Dependencies/012 Install PyYAML into specific folder and build zip_it.vtt6.59 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/21 - Spark Application Execution Life Cycle and Spark UI/006 Overview of Hadoop HDFS and YARN Setup on Multi-node Cluster_it.vtt6.54 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/04 - Setup Hadoop on Single Node Cluster/006 Download and Install Hadoop_it.vtt6.47 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/10 - Apache Spark using SQL - Basic Transformations using Spark SQL/010 Sorting Data_it.vtt6.47 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/12 - Apache Spark using SQL - DML and Partitioning/010 Using Dynamic Partition Mode_it.vtt6.45 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/21 - Spark Application Execution Life Cycle and Spark UI/011 Overview of Spark Submit Command_it.vtt6.45 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/18 - Apache Spark using Python - Joining Data Sets/007 Solution - Get Flight Count Per US Airport_it.vtt6.42 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/21 - Spark Application Execution Life Cycle and Spark UI/012 Switch between Python Versions to run Spark Applications or launch Pyspark CLI_it.vtt6.4 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/12 - Apache Spark using SQL - DML and Partitioning/005 Inserting Data using Stage Table_it.vtt6.33 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/20 - Apache Spark - Application Development Life Cycle/004 Accessing OS Environment Variables_it.vtt6.31 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/04 - Setup Hadoop on Single Node Cluster/016 Managing Single Node Hadoop_it.vtt6.28 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/13 - Apache Spark using SQL - Pre-defined Functions/003 Validating Functions_it.vtt6.27 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/11 - Apache Spark using SQL - Basic DDL and DML/005 Loading Data Into Tables - Local_it.vtt6.24 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/16 - Apache Spark using Python - Processing Column Data/010 Date and Time Manipulation Functions_it.vtt6.22 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/21 - Spark Application Execution Life Cycle and Spark UI/007 Overview of Spark Setup on top of Hadoop_it.vtt6.2 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/21 - Spark Application Execution Life Cycle and Spark UI/016 Run Spark Application and review default executors_it.vtt6.17 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/17 - Apache Spark using Python - Basic Transformations/002 Data Frames for basic transformations_it.vtt6.16 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/12 - Apache Spark using SQL - DML and Partitioning/003 Creating Tables using Parquet_it.vtt6.1 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/24 - Passing Application Properties Files and External Dependencies/010 Steps to pass external Python Libraries using pyfiles_it.vtt6.07 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/17 - Apache Spark using Python - Basic Transformations/016 Solution - Problem 2 - Get Total Aggregations By FlightDate_it.vtt6.07 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/05 - Setup Hive and Spark on Single Node Cluster/029 Intergrate Spark 3 with Jupyter Lab_it.vtt6.03 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/11 - Apache Spark using SQL - Basic DDL and DML/012 Truncating Tables_it.vtt2.89 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/20 - Apache Spark - Application Development Life Cycle/013 Setting up Data for Production Validation_it.vtt6.01 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/10 - Apache Spark using SQL - Basic Transformations using Spark SQL/011 Conclusion - Final Solution_it.vtt6 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/11 - Apache Spark using SQL - Basic DDL and DML/external-assets-links.txt1.6 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/21 - Spark Application Execution Life Cycle and Spark UI/014 Review Spark Configuration Properties at Run Time_it.vtt5.99 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/02 - Setting up Environment using AWS Cloud9/006 Associating Elastic IPs to Cloud9 Instance_it.vtt5.99 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/15 - Apache Spark using Python - Data Processing Overview/003 Understanding airlines data_it.vtt5.98 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/12 - Apache Spark using SQL - DML and Partitioning/002 Introduction to Partitioning_it.vtt1.91 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/16 - Apache Spark using Python - Processing Column Data/013 Date and Time Extract Functions_it.vtt5.98 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/24 - Passing Application Properties Files and External Dependencies/004 Build the JSON with Application Properties_it.vtt5.92 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/11 - Apache Spark using SQL - Basic DDL and DML/009 Managed Tables vs External Tables_it.vtt5.91 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/23 - Deployment Modes of Spark Applications/008 Overview of submitting spark application using cluster deployment mode_it.vtt5.89 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/23 - Deployment Modes of Spark Applications/007 Overview of submitting spark application using client deployment mode_it.vtt5.84 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/16 - Apache Spark using Python - Processing Column Data/005 Common String Manipulation Functions_it.vtt5.82 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/22 - Setup SSH Proxy to access Spark Application logs/012 Run Spark Application on Gateway Node using PuTTY_it.vtt5.74 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/12 - Apache Spark using SQL - DML and Partitioning/004 Load vs. Insert_it.vtt5.73 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/09 - Apache Spark using SQL - Getting Started/003 Launching and using Spark SQL CLI_it.vtt5.72 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/13 - Apache Spark using SQL - Pre-defined Functions/007 Data Type Conversion_it.vtt5.64 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/04 - Setup Hadoop on Single Node Cluster/004 Setup Password less login_it.vtt5.59 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/14 - Apache Spark SQL - Windowing Functions/009 Overview of Subqueries_it.vtt5.53 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/11 - Apache Spark using SQL - Basic DDL and DML/011 Drop Tables and Databases_it.vtt5.48 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/18 - Apache Spark using Python - Joining Data Sets/011 Solution - Get Count of Flights without master data_it.vtt5.46 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/07 - Overview of Hadoop HDFS Commands/002 Listing HDFS Files_it.vtt5.45 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/07 - Overview of Hadoop HDFS Commands/001 Getting help or usage_it.vtt5.45 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/09 - Apache Spark using SQL - Getting Started/006 Understanding Warehouse Directory_it.vtt5.42 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/18 - Apache Spark using Python - Joining Data Sets/013 Solution - Get Daily Revenue_it.vtt5.41 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/12 - Apache Spark using SQL - DML and Partitioning/external-assets-links.txt1.42 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/12 - Apache Spark using SQL - DML and Partitioning/007 Adding Partitions to Tables_it.vtt5.4 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/13 - Apache Spark using SQL - Pre-defined Functions/001 Introduction - Overview of Spark SQL Functions_it.vtt2.48 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/03 - Setting up Environment - Overview of GCP and Provision Ubuntu VM/006 Validating Python_it.vtt5.38 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/10 - Apache Spark using SQL - Basic Transformations using Spark SQL/005 Projecting Data_it.vtt5.31 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/02 - Setting up Environment using AWS Cloud9/003 Warming up with Cloud9 IDE_it.vtt5.16 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/24 - Passing Application Properties Files and External Dependencies/005 Explore APIs to process JSON Data using Pyspark_it.vtt5.05 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/02 - Setting up Environment using AWS Cloud9/007 Increase EBS Volume Size of Cloud9 Instance_it.vtt4.99 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/14 - Apache Spark SQL - Windowing Functions/002 Prepare HR Database_it.vtt4.97 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/05 - Setup Hive and Spark on Single Node Cluster/013 Download and Install Spark 2_it.vtt4.97 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/10 - Apache Spark using SQL - Basic Transformations using Spark SQL/003 Define Problem Statement_it.vtt4.93 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/16 - Apache Spark using Python - Processing Column Data/003 Categories of Functions_it.vtt4.9 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/12 - Apache Spark using SQL - DML and Partitioning/011 Exercise - Partitioned Tables_it.vtt4.89 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/22 - Setup SSH Proxy to access Spark Application logs/013 Configure Tunnel to Gateway Node using PuTTY on Windows for SSH Proxy_it.vtt4.88 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/21 - Spark Application Execution Life Cycle and Spark UI/002 Overview of Types of Spark Cluster Managers_it.vtt4.85 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/03 - Setting up Environment - Overview of GCP and Provision Ubuntu VM/001 Signing up for GCP_it.vtt4.79 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/22 - Setup SSH Proxy to access Spark Application logs/001 Setup SSH Proxy to access Spark Application logs - Introduction_it.vtt4.78 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/10 - Apache Spark using SQL - Basic Transformations using Spark SQL/001 Basic Transformations using Spark SQL - Introduction_it.vtt4.76 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/02 - Setting up Environment using AWS Cloud9/005 Opening ports for Cloud9 Instance_it.vtt4.71 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/12 - Apache Spark using SQL - DML and Partitioning/001 Introduction_it.vtt4.71 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/14 - Apache Spark SQL - Windowing Functions/003 Overview of Windowing Functions_it.vtt4.6 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/13 - Apache Spark using SQL - Pre-defined Functions/external-assets-links.txt1.23 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/12 - Apache Spark using SQL - DML and Partitioning/009 Inserting Data into Partitions_it.vtt4.54 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/14 - Apache Spark SQL - Windowing Functions/001 Introduction to Windowing Functions_it.vtt2.85 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/22 - Setup SSH Proxy to access Spark Application logs/018 Overview of performing tasks related to Spark Applications using Mac_it.vtt4.53 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/11 - Apache Spark using SQL - Basic DDL and DML/008 Creating External Tables_it.vtt4.51 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/09 - Apache Spark using SQL - Getting Started/005 Running OS Commands using Spark SQL_it.vtt4.46 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/02 - Setting up Environment using AWS Cloud9/001 Getting Started with Cloud9_it.vtt4.45 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/04 - Setup Hadoop on Single Node Cluster/001 Introduction to Single Node Hadoop Cluster_it.vtt4.43 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/12 - Apache Spark using SQL - DML and Partitioning/006 Creating Partitioned Tables_it.vtt4.42 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/09 - Apache Spark using SQL - Getting Started/008 Managing Spark Metastore Tables_it.vtt4.31 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/06 - Python Fundamentals/006 Setting up Data Sets for Basic IO Operations_it.vtt4.3 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/19 - Apache Spark using Python - Spark Metastore/002 Exploring Spark Catalog_it.vtt4.09 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/18 - Apache Spark using Python - Joining Data Sets/008 Solution - Get Flight Count Per US State_it.vtt4.07 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/22 - Setup SSH Proxy to access Spark Application logs/004 Setup sshuttle on Mac or Linux_it.vtt4.02 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/11 - Apache Spark using SQL - Basic DDL and DML/001 Introduction_it.vtt3.95 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/13 - Apache Spark using SQL - Pre-defined Functions/002 Overview of Functions_it.vtt3.94 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/18 - Apache Spark using Python - Joining Data Sets/012 Solution - Get Count of Flights per Airport without master data_it.vtt3.87 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/21 - Spark Application Execution Life Cycle and Spark UI/001 Deploying and Monitoring Spark Applications - Introduction_it.vtt3.8 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/24 - Passing Application Properties Files and External Dependencies/006 Refactor the Spark Application Code to use properties from JSON_it.vtt3.78 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/20 - Apache Spark - Application Development Life Cycle/005 Getting Started with Spark_it.vtt3.75 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/09 - Apache Spark using SQL - Getting Started/002 Overview of Spark Documentation_it.vtt3.59 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/14 - Apache Spark SQL - Windowing Functions/external-assets-links.txt65 B
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/24 - Passing Application Properties Files and External Dependencies/003 Setup Working Directory to pass application properties using JSON_it.vtt3.57 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/20 - Apache Spark - Application Development Life Cycle/007 Setup Sample Data_it.vtt3.54 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/17 - Apache Spark using Python - Basic Transformations/001 Overview of Basic Transformations_it.vtt3.42 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/11 - Apache Spark using SQL - Basic DDL and DML/007 Loading Data - Append and Overwrite_it.vtt3.41 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/08 - Apache Spark 2.x - Data processing - Getting Started/001 Introduction_it.vtt3.31 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/05 - Setup Hive and Spark on Single Node Cluster/003 Download and Install Hive_it.vtt3.23 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/07 - Overview of Hadoop HDFS Commands/010 Getting HDFS Storage Usage_it.vtt3.22 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/03 - Setting up Environment - Overview of GCP and Provision Ubuntu VM/002 Overview of GCP Web Console_it.vtt3.22 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/11 - Apache Spark using SQL - Basic DDL and DML/004 Adding Comments_it.vtt3.21 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/22 - Setup SSH Proxy to access Spark Application logs/007 Side effects of using SSH Proxy to access Spark Application Logs_it.vtt3.17 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/09 - Apache Spark using SQL - Getting Started/009 Retrieve Metadata of Tables_it.vtt3.11 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/24 - Passing Application Properties Files and External Dependencies/014 Refactor the Spark Application Code to use properties from YAML_it.vtt2.98 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/21 - Spark Application Execution Life Cycle and Spark UI/010 Review Deployment Process of Spark Application_it.vtt2.74 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/16 - Apache Spark using Python - Processing Column Data/external-assets-links.txt2.69 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/24 - Passing Application Properties Files and External Dependencies/011 Create required YAML File to externalize application properties_it.vtt2.45 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/17 - Apache Spark using Python - Basic Transformations/external-assets-links.txt2.29 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/23 - Deployment Modes of Spark Applications/001 Deployment Modes of Spark Applications - Introduction_it.vtt2.21 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/15 - Apache Spark using Python - Data Processing Overview/external-assets-links.txt1.08 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/22 - Setup SSH Proxy to access Spark Application logs/008 Steps to setup SSH Proxy on Windows to access Spark Application Logs_it.vtt2.1 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/18 - Apache Spark using Python - Joining Data Sets/004 Overview of Joins_it.vtt1.91 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/19 - Apache Spark using Python - Spark Metastore/external-assets-links.txt1.42 KiB
.pad/2148.46 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/08 - Apache Spark 2.x - Data processing - Getting Started/010 Create Data Frames from Text Files.mp4126.95 MiB
.pad/353.05 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/08 - Apache Spark 2.x - Data processing - Getting Started/008 Develop Simple Application.mp4110.79 MiB
.pad/4216.49 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/23 - Deployment Modes of Spark Applications/013 Submit Spark Applications using SPARK_CONF_DIR with custom properties files.mp499.99 MiB
.pad/58.69 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/08 - Apache Spark 2.x - Data processing - Getting Started/005 Quick Review of Spark APIs.mp497.3 MiB
.pad/6714.25 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/07 - Overview of Hadoop HDFS Commands/004 Copying files from local to HDFS.mp489.32 MiB
.pad/7692.73 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/07 - Overview of Hadoop HDFS Commands/003 Managing HDFS Directories.mp488.23 MiB
.pad/8785.37 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/13 - Apache Spark using SQL - Pre-defined Functions/005 Date Manipulation Functions.mp486.53 MiB
.pad/9481.11 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/23 - Deployment Modes of Spark Applications/010 Changing Spark Application Log Level using custom log4j properties.mp486.44 MiB
.pad/10573.55 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/10 - Apache Spark using SQL - Basic Transformations using Spark SQL/009 Aggregating Data.mp485.63 MiB
.pad/11375.22 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/21 - Spark Application Execution Life Cycle and Spark UI/008 Setup Data Set for Word Count application.mp484.42 MiB
.pad/12593.14 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/24 - Passing Application Properties Files and External Dependencies/009 Pass Application Properties to Spark Application using HDFS files.mp483.57 MiB
.pad/13441.92 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/23 - Deployment Modes of Spark Applications/002 Default Execution Master Type for Spark Applications.mp481.33 MiB
.pad/14688.31 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/20 - Apache Spark - Application Development Life Cycle/014 Running the application using YARN.mp479.92 MiB
.pad/1579.2 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/06 - Python Fundamentals/005 Map Reduce operations on Python Collections.mp479.9 MiB
.pad/16104.58 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/08 - Apache Spark 2.x - Data processing - Getting Started/004 Apache Spark Official Documentation (Very Important).mp479.54 MiB
.pad/17472.78 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/14 - Apache Spark SQL - Windowing Functions/005 Using LEAD or LAG.mp477.44 MiB
.pad/18574.57 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/03 - Setting up Environment - Overview of GCP and Provision Ubuntu VM/007 Setup Jupyter Lab.mp474.8 MiB
.pad/19209.44 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/23 - Deployment Modes of Spark Applications/004 Running Spark Applications using Local Mode.mp473.93 MiB
.pad/2067.24 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/14 - Apache Spark SQL - Windowing Functions/007 Ranking using Windowing Functions.mp473.32 MiB
.pad/21700.67 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/17 - Apache Spark using Python - Basic Transformations/014 Overview of Sorting Data Frames.mp472.24 MiB
.pad/22781.13 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/09 - Apache Spark using SQL - Getting Started/007 Managing Spark Metastore Databases.mp471.99 MiB
.pad/2310.32 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/03 - Setting up Environment - Overview of GCP and Provision Ubuntu VM/005 Setup Docker.mp471.91 MiB
.pad/2496.38 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/23 - Deployment Modes of Spark Applications/006 Accessing Local Files using Spark CLI or Spark Applications.mp470.99 MiB
.pad/2514.22 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/09 - Apache Spark using SQL - Getting Started/004 Overview of Spark SQL Properties.mp470.86 MiB
.pad/26145.58 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/15 - Apache Spark using Python - Data Processing Overview/002 Overview of Spark Read APIs.mp469.14 MiB
.pad/27880.03 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/23 - Deployment Modes of Spark Applications/003 Launch Pyspark using local mode.mp468.81 MiB
.pad/28191.05 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/23 - Deployment Modes of Spark Applications/012 Submit Spark Application using cluster mode with log level info.mp468.54 MiB
.pad/29474.86 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/13 - Apache Spark using SQL - Pre-defined Functions/004 String Manipulation Functions.mp468.01 MiB
.pad/301010.67 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/11 - Apache Spark using SQL - Basic DDL and DML/002 Create Spark Metastore Tables.mp467.74 MiB
.pad/31269.74 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/20 - Apache Spark - Application Development Life Cycle/015 Detailed Validation of the Application.mp467.73 MiB
.pad/32272.6 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/12 - Apache Spark using SQL - DML and Partitioning/008 Loading Data into Partitioned Tables.mp466.57 MiB
.pad/33437.46 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/11 - Apache Spark using SQL - Basic DDL and DML/003 Overview of Data Types.mp466.24 MiB
.pad/34779.32 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/07 - Overview of Hadoop HDFS Commands/006 Getting File Metadata.mp465.19 MiB
.pad/35825.43 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/23 - Deployment Modes of Spark Applications/011 Submit Spark Application using client mode with log level info.mp464.34 MiB
.pad/36673.67 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/05 - Setup Hive and Spark on Single Node Cluster/019 Validate Jupyter Lab Setup.mp464.27 MiB
.pad/37748.5 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/05 - Setup Hive and Spark on Single Node Cluster/017 Validate Spark 2 using CLIs.mp463.42 MiB
.pad/38593.63 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/23 - Deployment Modes of Spark Applications/005 Overview of Spark CLI Commands such as Pyspark.mp463.01 MiB
.pad/391009.34 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/17 - Apache Spark using Python - Basic Transformations/010 Total Aggregations.mp462.67 MiB
.pad/40339.02 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/14 - Apache Spark SQL - Windowing Functions/006 Getting first and last values.mp461.52 MiB
.pad/41491.78 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/23 - Deployment Modes of Spark Applications/014 Submit Spark Applications using Properties File.mp460.7 MiB
.pad/42307.74 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/05 - Setup Hive and Spark on Single Node Cluster/027 Validate Spark 3 using CLIs.mp459.91 MiB
.pad/4392.41 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/21 - Spark Application Execution Life Cycle and Spark UI/009 Develop Word Count Application.mp458.11 MiB
.pad/44911.51 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/14 - Apache Spark SQL - Windowing Functions/004 Aggregations using Windowing Functions.mp457.49 MiB
.pad/45518.8 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/11 - Apache Spark using SQL - Basic DDL and DML/010 Overview of File Formats.mp457.42 MiB
.pad/46595.7 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/15 - Apache Spark using Python - Data Processing Overview/008 Overview of Spark Write APIs.mp457.13 MiB
.pad/47891.09 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/05 - Setup Hive and Spark on Single Node Cluster/025 Configure Spark 3.mp456.65 MiB
.pad/48359.03 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/09 - Apache Spark using SQL - Getting Started/011 Exercise - Getting Started with Spark SQL.mp453.81 MiB
.pad/49190.7 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/18 - Apache Spark using Python - Joining Data Sets/006 Left or Right Outer Join.mp452.42 MiB
.pad/50593.64 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/08 - Apache Spark 2.x - Data processing - Getting Started/002 Review of Setup Steps for Spark Environment.mp452 MiB
.pad/51426 B
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/05 - Setup Hive and Spark on Single Node Cluster/007 Configure and Setup Hive Metastore.mp451.1 MiB
.pad/52919.87 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/24 - Passing Application Properties Files and External Dependencies/007 Pass Application Properties to Spark Application using local files in client mod.mp451.01 MiB
.pad/531018.42 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/15 - Apache Spark using Python - Data Processing Overview/004 Inferring Schema.mp450.73 MiB
.pad/54280.91 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/17 - Apache Spark using Python - Basic Transformations/005 Boolean Operators.mp450.72 MiB
.pad/55286.89 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/24 - Passing Application Properties Files and External Dependencies/008 Pass Application Properties to Spark Application using local files in cluster mo.mp450.49 MiB
.pad/56527.03 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/03 - Setting up Environment - Overview of GCP and Provision Ubuntu VM/004 Provision Ubuntu VM from GCP.mp450.31 MiB
.pad/57706.78 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/07 - Overview of Hadoop HDFS Commands/009 HDFS Replication Factor.mp450.02 MiB
.pad/581006.82 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/07 - Overview of Hadoop HDFS Commands/012 HDFS File Permissions.mp449.7 MiB
.pad/59306.88 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/19 - Apache Spark using Python - Spark Metastore/003 Creating Metastore Tables using catalog.mp449.68 MiB
.pad/60328.69 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/22 - Setup SSH Proxy to access Spark Application logs/016 Update Hosts file for worker nodes using private ips.mp448.3 MiB
.pad/61719.19 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/18 - Apache Spark using Python - Joining Data Sets/002 Analyze Datasets for Joins.mp447.73 MiB
.pad/62281.08 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/05 - Setup Hive and Spark on Single Node Cluster/015 Configure Spark 2.mp447.32 MiB
.pad/63697.79 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/24 - Passing Application Properties Files and External Dependencies/002 Steps to pass application properties using JSON.mp446.86 MiB
.pad/64139.3 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/06 - Python Fundamentals/002 Basic Programming Constructs.mp446.82 MiB
.pad/65187.13 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/19 - Apache Spark using Python - Spark Metastore/008 Create Partitioned Tables.mp446.52 MiB
.pad/66493.6 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/11 - Apache Spark using SQL - Basic DDL and DML/006 Loading Data Into Tables - HDFS.mp445.37 MiB
.pad/67650.16 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/13 - Apache Spark using SQL - Pre-defined Functions/009 Using CASE and WHEN.mp445.29 MiB
.pad/68722.99 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/04 - Setup Hadoop on Single Node Cluster/008 Configure Hadoop HDFS.mp444.93 MiB
.pad/6971.65 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/22 - Setup SSH Proxy to access Spark Application logs/003 Overview of SSH Proxy.mp444.93 MiB
.pad/7074.26 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/10 - Apache Spark using SQL - Basic Transformations using Spark SQL/008 Joining Tables - Outer.mp442.76 MiB
.pad/71240.81 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/16 - Apache Spark using Python - Processing Column Data/006 Extracting Strings using substring.mp442.59 MiB
.pad/72423.23 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/19 - Apache Spark using Python - Spark Metastore/005 Define Schema for Tables using StructType.mp442.15 MiB
.pad/73871.88 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/13 - Apache Spark using SQL - Pre-defined Functions/010 Query Example - Word Count.mp441.49 MiB
.pad/74517.26 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/15 - Apache Spark using Python - Data Processing Overview/005 Previewing Airlines Data.mp441.3 MiB
.pad/75719.22 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/08 - Apache Spark 2.x - Data processing - Getting Started/003 Using ITVersity labs.mp441.29 MiB
.pad/76726.68 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/08 - Apache Spark 2.x - Data processing - Getting Started/011 Create Data Frames from Hive Tables.mp441.16 MiB
.pad/77858.22 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/20 - Apache Spark - Application Development Life Cycle/009 Process data using Spark APIs.mp441 MiB
.pad/781020.63 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/07 - Overview of Hadoop HDFS Commands/002 Listing HDFS Files.mp440.89 MiB
.pad/79109.23 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/08 - Apache Spark 2.x - Data processing - Getting Started/006 Spark Modules.mp440.59 MiB
.pad/80421.83 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/17 - Apache Spark using Python - Basic Transformations/015 Solution - Problem 1 - Get Total Aggregations.mp440.47 MiB
.pad/81543.22 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/20 - Apache Spark - Application Development Life Cycle/013 Setting up Data for Production Validation.mp440.13 MiB
.pad/82891.08 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/07 - Overview of Hadoop HDFS Commands/013 Overriding Properties.mp440.12 MiB
.pad/83896.96 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/04 - Setup Hadoop on Single Node Cluster/010 Start and Validate HDFS.mp439.85 MiB
.pad/84157.23 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/13 - Apache Spark using SQL - Pre-defined Functions/008 Dealing with Nulls.mp439.56 MiB
.pad/85450.89 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/18 - Apache Spark using Python - Joining Data Sets/005 Using Inner Joins.mp439.53 MiB
.pad/86480.44 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/07 - Overview of Hadoop HDFS Commands/008 HDFS Block Size.mp439.13 MiB
.pad/87888.88 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/02 - Setting up Environment using AWS Cloud9/008 Setup Jupyter Lab on Cloud9.mp439.1 MiB
.pad/88916.7 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/21 - Spark Application Execution Life Cycle and Spark UI/013 Switch between Pyspark Versions to run Spark Applications or launch Pyspark CLI.mp438.78 MiB
.pad/89228.03 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/07 - Overview of Hadoop HDFS Commands/005 Copying files from HDFS to local.mp438.74 MiB
.pad/90269.78 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/10 - Apache Spark using SQL - Basic Transformations using Spark SQL/006 Filtering Data.mp438.72 MiB
.pad/91290.27 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/24 - Passing Application Properties Files and External Dependencies/015 Pass External Dependencies to Spark Application using local files in client mode.mp438.71 MiB
.pad/92298.66 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/05 - Setup Hive and Spark on Single Node Cluster/005 Setup Database for Hive Metastore.mp438.25 MiB
.pad/93773.09 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/16 - Apache Spark using Python - Processing Column Data/007 Extracting Strings using split.mp437.84 MiB
.pad/94160.33 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/16 - Apache Spark using Python - Processing Column Data/012 Using Date and Time Trunc Functions.mp437.81 MiB
.pad/95199.49 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/15 - Apache Spark using Python - Data Processing Overview/006 Overview of Data Frame APIs.mp437.49 MiB
.pad/96526.38 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/24 - Passing Application Properties Files and External Dependencies/016 Pass External Dependencies to Spark Apps using local files in cluster mode.mp437.48 MiB
.pad/97537.2 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/15 - Apache Spark using Python - Data Processing Overview/001 Starting Spark Context - pyspark.mp437.46 MiB
.pad/98556.4 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/22 - Setup SSH Proxy to access Spark Application logs/002 Overview of Private and Public ips of servers in the cluster.mp437.14 MiB
.pad/99876.68 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/24 - Passing Application Properties Files and External Dependencies/017 Pass External Dependencies to Spark Application using HDFS files.mp436.35 MiB
.pad/100670.28 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/20 - Apache Spark - Application Development Life Cycle/008 Read data from files.mp436.01 MiB
.pad/1011012.98 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/19 - Apache Spark using Python - Spark Metastore/006 Inserting into Existing Tables.mp435.55 MiB
.pad/102465.77 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/14 - Apache Spark SQL - Windowing Functions/008 Order of execution of SQL.mp434.98 MiB
.pad/10324.36 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/05 - Setup Hive and Spark on Single Node Cluster/009 Launch and Validate Hive.mp434.28 MiB
.pad/104741.51 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/06 - Python Fundamentals/004 Python Collections.mp434.27 MiB
.pad/105747.27 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/16 - Apache Spark using Python - Processing Column Data/004 Special Functions - col and lit.mp433.97 MiB
.pad/10625.71 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/22 - Setup SSH Proxy to access Spark Application logs/015 Understanding Proxying Network Traffic overcoming Windows Caveats.mp433.8 MiB
.pad/107202.4 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/07 - Overview of Hadoop HDFS Commands/001 Getting help or usage.mp433.24 MiB
.pad/108774.43 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/03 - Setting up Environment - Overview of GCP and Provision Ubuntu VM/006 Validating Python.mp432.86 MiB
.pad/109145.63 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/03 - Setting up Environment - Overview of GCP and Provision Ubuntu VM/003 Overview of GCP Pricing.mp432.3 MiB
.pad/110718.43 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/16 - Apache Spark using Python - Processing Column Data/017 Dealing with Nulls.mp431.3 MiB
.pad/111715.01 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/16 - Apache Spark using Python - Processing Column Data/014 Using to_date and to_timestamp.mp430.53 MiB
.pad/112482.42 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/21 - Spark Application Execution Life Cycle and Spark UI/017 Overview of Spark History Server UI.mp430.43 MiB
.pad/113585.74 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/10 - Apache Spark using SQL - Basic Transformations using Spark SQL/007 Joining Tables - Inner.mp430.01 MiB
.pad/1141013.51 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/21 - Spark Application Execution Life Cycle and Spark UI/011 Overview of Spark Submit Command.mp429.91 MiB
.pad/11590.54 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/13 - Apache Spark using SQL - Pre-defined Functions/003 Validating Functions.mp429.8 MiB
.pad/116204.98 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/21 - Spark Application Execution Life Cycle and Spark UI/014 Review Spark Configuration Properties at Run Time.mp429.76 MiB
.pad/117241.03 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/04 - Setup Hadoop on Single Node Cluster/002 Setup Prerequisties.mp429.69 MiB
.pad/118316.71 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/11 - Apache Spark using SQL - Basic DDL and DML/013 Exercise - Managed Tables.mp429.62 MiB
.pad/119389.21 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/13 - Apache Spark using SQL - Pre-defined Functions/006 Overview of Numeric Functions.mp429.35 MiB
.pad/120666.84 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/15 - Apache Spark using Python - Data Processing Overview/36340624-airtrafficdata.zip29.33 MiB
.pad/121690.15 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/04 - Setup Hadoop on Single Node Cluster/006 Download and Install Hadoop.mp428.98 MiB
.pad/12221.19 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/11 - Apache Spark using SQL - Basic DDL and DML/009 Managed Tables vs External Tables.mp428.91 MiB
.pad/12389.82 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/19 - Apache Spark using Python - Spark Metastore/011 Using Spark SQL.mp428.61 MiB
.pad/124398.33 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/16 - Apache Spark using Python - Processing Column Data/008 Padding Characters around Strings.mp428.29 MiB
.pad/125727.24 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/17 - Apache Spark using Python - Basic Transformations/006 Using IN Operator or isin Function.mp428.13 MiB
.pad/126890.95 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/12 - Apache Spark using SQL - DML and Partitioning/005 Inserting Data using Stage Table.mp428.13 MiB
.pad/127891.98 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/17 - Apache Spark using Python - Basic Transformations/011 Aggregate data using groupBy.mp427.92 MiB
.pad/12883.49 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/21 - Spark Application Execution Life Cycle and Spark UI/006 Overview of Hadoop HDFS and YARN Setup on Multi-node Cluster.mp427.85 MiB
.pad/129155.79 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/22 - Setup SSH Proxy to access Spark Application logs/017 Access Spark Application logs using SSH Proxy.mp427.82 MiB
.pad/130187.46 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/24 - Passing Application Properties Files and External Dependencies/013 Explore APIs to process YAML Data using Pyspark.mp427.78 MiB
.pad/131222.53 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/10 - Apache Spark using SQL - Basic Transformations using Spark SQL/004 Prepare Tables.mp427.66 MiB
.pad/132343.54 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/22 - Setup SSH Proxy to access Spark Application logs/014 Setup Proxy on Windows and validate using Microsoft Edge browser.mp427.66 MiB
.pad/133346.45 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/09 - Apache Spark using SQL - Getting Started/010 Role of Spark Metastore or Hive Metastore.mp427.39 MiB
.pad/134624.73 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/16 - Apache Spark using Python - Processing Column Data/018 Using CASE and WHEN.mp427.3 MiB
.pad/135721.7 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/06 - Python Fundamentals/001 Introduction and Setting up Python.mp427.25 MiB
.pad/136767.54 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/21 - Spark Application Execution Life Cycle and Spark UI/005 Understanding YARN Capacity of an Enterprise Cluster.mp427.19 MiB
.pad/137830.13 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/12 - Apache Spark using SQL - DML and Partitioning/010 Using Dynamic Partition Mode.mp427.16 MiB
.pad/138857.34 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/17 - Apache Spark using Python - Basic Transformations/004 Filtering Example using dates.mp427.09 MiB
.pad/139932.36 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/02 - Setting up Environment using AWS Cloud9/006 Associating Elastic IPs to Cloud9 Instance.mp427.06 MiB
.pad/140958.28 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/06 - Python Fundamentals/003 Functions in Python.mp426.45 MiB
.pad/141558.34 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/19 - Apache Spark using Python - Spark Metastore/009 Saving as Partitioned Table.mp426.32 MiB
.pad/142695.19 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/21 - Spark Application Execution Life Cycle and Spark UI/007 Overview of Spark Setup on top of Hadoop.mp426.14 MiB
.pad/143881.21 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/10 - Apache Spark using SQL - Basic Transformations using Spark SQL/010 Sorting Data.mp425.94 MiB
.pad/14459.53 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/20 - Apache Spark - Application Development Life Cycle/002 Getting Started with Pycharm.mp425.93 MiB
.pad/14571.01 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/19 - Apache Spark using Python - Spark Metastore/004 Inferring Schema for Tables.mp425.89 MiB
.pad/146113.26 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/17 - Apache Spark using Python - Basic Transformations/002 Data Frames for basic transformations.mp425.71 MiB
.pad/147301.55 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/15 - Apache Spark using Python - Data Processing Overview/007 Overview of Functions.mp425.54 MiB
.pad/148468.06 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/20 - Apache Spark - Application Development Life Cycle/001 Setup Virtual Environment and Install Pyspark.mp425.45 MiB
.pad/149558.56 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/22 - Setup SSH Proxy to access Spark Application logs/005 Proxy using sshuttle on Mac or Linux.mp425.23 MiB
.pad/150791.47 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/09 - Apache Spark using SQL - Getting Started/005 Running OS Commands using Spark SQL.mp425.22 MiB
.pad/151794.31 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/19 - Apache Spark using Python - Spark Metastore/010 Creating Temporary Views.mp425.22 MiB
.pad/152801.69 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/17 - Apache Spark using Python - Basic Transformations/008 Using BETWEEN Operator.mp424.97 MiB
.pad/15332.17 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/16 - Apache Spark using Python - Processing Column Data/001 Overview of Predefined Functions in Spark.mp424.85 MiB
.pad/154158.33 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/20 - Apache Spark - Application Development Life Cycle/011 Validating Writing Data to Files.mp424.73 MiB
.pad/155275.78 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/17 - Apache Spark using Python - Basic Transformations/016 Solution - Problem 2 - Get Total Aggregations By FlightDate.mp424.49 MiB
.pad/156525.96 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/19 - Apache Spark using Python - Spark Metastore/001 Overview of Spark Metastore.mp424.44 MiB
.pad/157569.66 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/20 - Apache Spark - Application Development Life Cycle/006 Create Function for Spark Session.mp424.02 MiB
.pad/1581004.26 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/17 - Apache Spark using Python - Basic Transformations/013 Aggregate data using cube.cmproj.mp423.87 MiB
.pad/159130.22 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/10 - Apache Spark using SQL - Basic Transformations using Spark SQL/005 Projecting Data.mp423.55 MiB
.pad/160457.31 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/09 - Apache Spark using SQL - Getting Started/006 Understanding Warehouse Directory.mp423.38 MiB
.pad/161632.66 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/22 - Setup SSH Proxy to access Spark Application logs/011 Configure Passwordless Login using PuTTYGen Keys on Windows.mp423.28 MiB
.pad/162740.66 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/17 - Apache Spark using Python - Basic Transformations/003 Basic Filtering of Data.mp423.27 MiB
.pad/163752.33 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/16 - Apache Spark using Python - Processing Column Data/016 Dealing with Unix Timestamp.mp422.96 MiB
.pad/16443.2 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/10 - Apache Spark using SQL - Basic Transformations using Spark SQL/002 Spark SQL - Overview.mp422.56 MiB
.pad/165453.47 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/23 - Deployment Modes of Spark Applications/009 Review the default logging while submitting Spark Applications.mp422.35 MiB
.pad/166661.47 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/24 - Passing Application Properties Files and External Dependencies/001 Passing Application Properties Files and External Dependencies - Introduction.mp422.27 MiB
.pad/167748.4 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/22 - Setup SSH Proxy to access Spark Application logs/012 Run Spark Application on Gateway Node using PuTTY.mp422.13 MiB
.pad/168886.83 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/05 - Setup Hive and Spark on Single Node Cluster/029 Intergrate Spark 3 with Jupyter Lab.mp421.9 MiB
.pad/169106.4 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/22 - Setup SSH Proxy to access Spark Application logs/006 Accessing Spark Application logs via SSH Proxy using sshuttle on Mac or Linux.mp421.78 MiB
.pad/170228.55 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/09 - Apache Spark using SQL - Getting Started/003 Launching and using Spark SQL CLI.mp421.59 MiB
.pad/171417.72 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/05 - Setup Hive and Spark on Single Node Cluster/021 Integrate Spark 2 with Jupyter Lab.mp421.59 MiB
.pad/172418.9 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/23 - Deployment Modes of Spark Applications/007 Overview of submitting spark application using client deployment mode.mp421.44 MiB
.pad/173576.48 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/22 - Setup SSH Proxy to access Spark Application logs/010 Quick Tour of PuTTY on Windows.mp421.05 MiB
.pad/174969.03 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/16 - Apache Spark using Python - Processing Column Data/011 Date and Time Arithmetic.mp420.94 MiB
.pad/17557.61 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/21 - Spark Application Execution Life Cycle and Spark UI/015 Develop Shell Script to run Spark Application.mp420.72 MiB
.pad/176282.59 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/05 - Setup Hive and Spark on Single Node Cluster/001 Setup Data Sets for Practice.mp420.55 MiB
.pad/177459.63 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/18 - Apache Spark using Python - Joining Data Sets/001 Prepare Datasets for Joins.mp420.44 MiB
.pad/178568.42 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/17 - Apache Spark using Python - Basic Transformations/012 Aggregate data using rollup.mp420.25 MiB
.pad/179766.1 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/20 - Apache Spark - Application Development Life Cycle/010 Write data to files.mp420.22 MiB
.pad/180803.39 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/14 - Apache Spark SQL - Windowing Functions/002 Prepare HR Database.mp420.17 MiB
.pad/181852.54 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/16 - Apache Spark using Python - Processing Column Data/015 Using date_format Function.mp419.97 MiB
.pad/18234.09 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/17 - Apache Spark using Python - Basic Transformations/009 Dealing with Nulls while Filtering.mp419.62 MiB
.pad/183387.07 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/01 - Introduction/001 CCA 175 Spark and Hadoop Developer - Curriculum.mp419.38 MiB
.pad/184635.9 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/21 - Spark Application Execution Life Cycle and Spark UI/016 Run Spark Application and review default executors.mp419.35 MiB
.pad/185669.81 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/07 - Overview of Hadoop HDFS Commands/007 Previewing Data in HDFS File.mp418.55 MiB
.pad/186465.9 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/19 - Apache Spark using Python - Spark Metastore/007 Read and Process data from Metastore Tables.mp418.51 MiB
.pad/187505.18 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/11 - Apache Spark using SQL - Basic DDL and DML/007 Loading Data - Append and Overwrite.mp417.97 MiB
.pad/18829.38 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/21 - Spark Application Execution Life Cycle and Spark UI/004 Overall Capacity of Big Data Cluster with Hadoop and Spark.mp417.93 MiB
.pad/18974.56 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/12 - Apache Spark using SQL - DML and Partitioning/006 Creating Partitioned Tables.mp417.84 MiB
.pad/190159.45 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/04 - Setup Hadoop on Single Node Cluster/016 Managing Single Node Hadoop.mp417.41 MiB
.pad/191605.08 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/21 - Spark Application Execution Life Cycle and Spark UI/003 Setup EMR Cluster with Hadoop and Spark.mp417.06 MiB
.pad/192966.74 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/21 - Spark Application Execution Life Cycle and Spark UI/002 Overview of Types of Spark Cluster Managers.mp416.96 MiB
.pad/19340.51 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/16 - Apache Spark using Python - Processing Column Data/009 Trimming Characters from Strings.mp416.88 MiB
.pad/194122.25 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/05 - Setup Hive and Spark on Single Node Cluster/011 Scripts to Manage Single Node Cluster.mp416.83 MiB
.pad/195177.64 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/20 - Apache Spark - Application Development Life Cycle/003 Passing Run Time Arguments.mp416.73 MiB
.pad/196279.18 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/14 - Apache Spark SQL - Windowing Functions/010 Filtering Window Function Results.mp416.7 MiB
.pad/197305.12 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/17 - Apache Spark using Python - Basic Transformations/007 Using LIKE Operator or like Function.mp416.48 MiB
.pad/198532.42 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/24 - Passing Application Properties Files and External Dependencies/004 Build the JSON with Application Properties.mp416.4 MiB
.pad/199617.66 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/02 - Setting up Environment using AWS Cloud9/002 Creating Cloud9 Environment.mp416.32 MiB
.pad/200694.56 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/11 - Apache Spark using SQL - Basic DDL and DML/011 Drop Tables and Databases.mp416.15 MiB
.pad/201873.92 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/06 - Python Fundamentals/006 Setting up Data Sets for Basic IO Operations.mp416.05 MiB
.pad/202968.47 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/22 - Setup SSH Proxy to access Spark Application logs/004 Setup sshuttle on Mac or Linux.mp415.76 MiB
.pad/203242.48 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/11 - Apache Spark using SQL - Basic DDL and DML/005 Loading Data Into Tables - Local.mp415.71 MiB
.pad/204300.25 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/12 - Apache Spark using SQL - DML and Partitioning/003 Creating Tables using Parquet.mp415.55 MiB
.pad/205460.82 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/24 - Passing Application Properties Files and External Dependencies/005 Explore APIs to process JSON Data using Pyspark.mp415.3 MiB
.pad/206713.51 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/18 - Apache Spark using Python - Joining Data Sets/014 Solution - Get Daily Revenue rolled up till Yearly.mp415.25 MiB
.pad/207767.5 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/09 - Apache Spark using SQL - Getting Started/009 Retrieve Metadata of Tables.mp415.25 MiB
.pad/208772.28 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/09 - Apache Spark using SQL - Getting Started/002 Overview of Spark Documentation.mp415.19 MiB
.pad/209825.08 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/21 - Spark Application Execution Life Cycle and Spark UI/012 Switch between Python Versions to run Spark Applications or launch Pyspark CLI.mp415.12 MiB
.pad/210898.11 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/24 - Passing Application Properties Files and External Dependencies/012 Install PyYAML into specific folder and build zip.mp415.05 MiB
.pad/211975.67 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/18 - Apache Spark using Python - Joining Data Sets/008 Solution - Get Flight Count Per US State.mp414.94 MiB
.pad/21262.15 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/16 - Apache Spark using Python - Processing Column Data/013 Date and Time Extract Functions.mp414.88 MiB
.pad/213118.39 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/20 - Apache Spark - Application Development Life Cycle/005 Getting Started with Spark.mp414.82 MiB
.pad/214188.02 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/15 - Apache Spark using Python - Data Processing Overview/003 Understanding airlines data.mp414.75 MiB
.pad/215251.06 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/18 - Apache Spark using Python - Joining Data Sets/010 Solution - Get Origins without master data.mp414.73 MiB
.pad/216279.03 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/02 - Setting up Environment using AWS Cloud9/007 Increase EBS Volume Size of Cloud9 Instance.mp414.49 MiB
.pad/217524.06 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/23 - Deployment Modes of Spark Applications/008 Overview of submitting spark application using cluster deployment mode.mp414.35 MiB
.pad/218661.82 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/12 - Apache Spark using SQL - DML and Partitioning/004 Load vs. Insert.mp414.34 MiB
.pad/219675.48 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/07 - Overview of Hadoop HDFS Commands/010 Getting HDFS Storage Usage.mp414.13 MiB
.pad/220893.51 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/12 - Apache Spark using SQL - DML and Partitioning/007 Adding Partitions to Tables.mp413.74 MiB
.pad/221264.92 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/10 - Apache Spark using SQL - Basic Transformations using Spark SQL/011 Conclusion - Final Solution.mp413.68 MiB
.pad/222331.07 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/12 - Apache Spark using SQL - DML and Partitioning/001 Introduction.mp413.66 MiB
.pad/223345.26 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/20 - Apache Spark - Application Development Life Cycle/007 Setup Sample Data.mp413.63 MiB
.pad/224378.12 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/18 - Apache Spark using Python - Joining Data Sets/007 Solution - Get Flight Count Per US Airport.mp413.55 MiB
.pad/225464.66 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/16 - Apache Spark using Python - Processing Column Data/005 Common String Manipulation Functions.mp413.27 MiB
.pad/226746.69 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/11 - Apache Spark using SQL - Basic DDL and DML/008 Creating External Tables.mp413.2 MiB
.pad/227820.59 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/18 - Apache Spark using Python - Joining Data Sets/011 Solution - Get Count of Flights without master data.mp413.07 MiB
.pad/228953.34 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/18 - Apache Spark using Python - Joining Data Sets/013 Solution - Get Daily Revenue.mp413.06 MiB
.pad/229958.86 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/11 - Apache Spark using SQL - Basic DDL and DML/004 Adding Comments.mp412.93 MiB
.pad/23070.12 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/12 - Apache Spark using SQL - DML and Partitioning/011 Exercise - Partitioned Tables.mp412.87 MiB
.pad/231132.15 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/18 - Apache Spark using Python - Joining Data Sets/009 Solution - Get Dormant US Airports.mp412.81 MiB
.pad/232196.01 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/16 - Apache Spark using Python - Processing Column Data/002 Create Dummy Data Frame.mp412.51 MiB
.pad/233496.73 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/13 - Apache Spark using SQL - Pre-defined Functions/007 Data Type Conversion.mp412.35 MiB
.pad/234668.44 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/20 - Apache Spark - Application Development Life Cycle/012 Productionizing the Code.mp412 MiB
.pad/2351022.99 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/14 - Apache Spark SQL - Windowing Functions/009 Overview of Subqueries.mp411.99 MiB
.pad/23610.1 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/05 - Setup Hive and Spark on Single Node Cluster/013 Download and Install Spark 2.mp411.89 MiB
.pad/237110.07 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/12 - Apache Spark using SQL - DML and Partitioning/009 Inserting Data into Partitions.mp411.53 MiB
.pad/238481.2 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/16 - Apache Spark using Python - Processing Column Data/010 Date and Time Manipulation Functions.mp411.42 MiB
.pad/239589.53 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/03 - Setting up Environment - Overview of GCP and Provision Ubuntu VM/001 Signing up for GCP.mp411.41 MiB
.pad/240600.09 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/11 - Apache Spark using SQL - Basic DDL and DML/001 Introduction.mp411.23 MiB
.pad/241787 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/24 - Passing Application Properties Files and External Dependencies/010 Steps to pass external Python Libraries using pyfiles.mp411.18 MiB
.pad/242840.36 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/22 - Setup SSH Proxy to access Spark Application logs/001 Setup SSH Proxy to access Spark Application logs - Introduction.mp411.06 MiB
.pad/243960.72 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/22 - Setup SSH Proxy to access Spark Application logs/013 Configure Tunnel to Gateway Node using PuTTY on Windows for SSH Proxy.mp411.04 MiB
.pad/244981.93 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/20 - Apache Spark - Application Development Life Cycle/004 Accessing OS Environment Variables.mp410.83 MiB
.pad/245171.58 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/02 - Setting up Environment using AWS Cloud9/001 Getting Started with Cloud9.mp410.79 MiB
.pad/246210.39 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/24 - Passing Application Properties Files and External Dependencies/006 Refactor the Spark Application Code to use properties from JSON.mp410.64 MiB
.pad/247369.98 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/14 - Apache Spark SQL - Windowing Functions/003 Overview of Windowing Functions.mp410.6 MiB
.pad/248407.09 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/02 - Setting up Environment using AWS Cloud9/005 Opening ports for Cloud9 Instance.mp410.56 MiB
.pad/249446.88 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/19 - Apache Spark using Python - Spark Metastore/002 Exploring Spark Catalog.mp410.46 MiB
.pad/250556.21 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/22 - Setup SSH Proxy to access Spark Application logs/018 Overview of performing tasks related to Spark Applications using Mac.mp410.07 MiB
.pad/251954.48 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/11 - Apache Spark using SQL - Basic DDL and DML/012 Truncating Tables.mp49.03 MiB
.pad/252996.04 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/13 - Apache Spark using SQL - Pre-defined Functions/002 Overview of Functions.mp49.03 MiB
.pad/253998 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/10 - Apache Spark using SQL - Basic Transformations using Spark SQL/003 Define Problem Statement.mp48.76 MiB
.pad/254247.06 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/04 - Setup Hadoop on Single Node Cluster/001 Introduction to Single Node Hadoop Cluster.mp48.69 MiB
.pad/255313.51 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/18 - Apache Spark using Python - Joining Data Sets/012 Solution - Get Count of Flights per Airport without master data.mp48.65 MiB
.pad/256360.77 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/16 - Apache Spark using Python - Processing Column Data/003 Categories of Functions.mp48.25 MiB
.pad/257767.48 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/09 - Apache Spark using SQL - Getting Started/008 Managing Spark Metastore Tables.mp48.17 MiB
.pad/258853.38 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/10 - Apache Spark using SQL - Basic Transformations using Spark SQL/001 Basic Transformations using Spark SQL - Introduction.mp48.04 MiB
.pad/259985.32 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/21 - Spark Application Execution Life Cycle and Spark UI/001 Deploying and Monitoring Spark Applications - Introduction.mp47.9 MiB
.pad/26098.36 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/22 - Setup SSH Proxy to access Spark Application logs/007 Side effects of using SSH Proxy to access Spark Application Logs.mp47.87 MiB
.pad/261136.1 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/04 - Setup Hadoop on Single Node Cluster/004 Setup Password less login.mp47.7 MiB
.pad/262304.46 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/02 - Setting up Environment using AWS Cloud9/003 Warming up with Cloud9 IDE.mp47.5 MiB
.pad/263510.27 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/24 - Passing Application Properties Files and External Dependencies/003 Setup Working Directory to pass application properties using JSON.mp47.06 MiB
.pad/264963.23 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/17 - Apache Spark using Python - Basic Transformations/001 Overview of Basic Transformations.mp46.87 MiB
.pad/265130.07 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/03 - Setting up Environment - Overview of GCP and Provision Ubuntu VM/002 Overview of GCP Web Console.mp46.76 MiB
.pad/266250.09 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/05 - Setup Hive and Spark on Single Node Cluster/023 Download and Install Spark 3.mp46.72 MiB
.pad/267283.29 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/05 - Setup Hive and Spark on Single Node Cluster/003 Download and Install Hive.mp46.62 MiB
.pad/268387.58 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/08 - Apache Spark 2.x - Data processing - Getting Started/001 Introduction.mp45.76 MiB
.pad/269241.91 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/22 - Setup SSH Proxy to access Spark Application logs/008 Steps to setup SSH Proxy on Windows to access Spark Application Logs.mp45.74 MiB
.pad/270267.81 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/07 - Overview of Hadoop HDFS Commands/011 Using HDFS Stat Commands.mp45.64 MiB
.pad/271369.28 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/23 - Deployment Modes of Spark Applications/001 Deployment Modes of Spark Applications - Introduction.mp45.55 MiB
.pad/272465 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/04 - Setup Hadoop on Single Node Cluster/014 Start and Validate YARN.mp45.19 MiB
.pad/273826.72 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/02 - Setting up Environment using AWS Cloud9/004 Overview of EC2 related to Cloud9.mp45.03 MiB
.pad/274993.22 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/14 - Apache Spark SQL - Windowing Functions/001 Introduction to Windowing Functions.mp44.98 MiB
.pad/27517.76 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/09 - Apache Spark using SQL - Getting Started/001 Getting Started - Overview.mp44.97 MiB
.pad/27629.81 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/13 - Apache Spark using SQL - Pre-defined Functions/001 Introduction - Overview of Spark SQL Functions.mp44.61 MiB
.pad/277395.01 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/24 - Passing Application Properties Files and External Dependencies/014 Refactor the Spark Application Code to use properties from YAML.mp44.42 MiB
.pad/278592.24 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/18 - Apache Spark using Python - Joining Data Sets/003 Problem Statements for Joins.mp43.97 MiB
.pad/27926.45 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/24 - Passing Application Properties Files and External Dependencies/011 Create required YAML File to externalize application properties.mp43.76 MiB
.pad/280248.63 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/12 - Apache Spark using SQL - DML and Partitioning/002 Introduction to Partitioning.mp43.68 MiB
.pad/281326.09 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/04 - Setup Hadoop on Single Node Cluster/012 Configure Hadoop YARN.mp43.63 MiB
.pad/282375.92 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/18 - Apache Spark using Python - Joining Data Sets/004 Overview of Joins.mp42.93 MiB
.pad/28376.35 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/21 - Spark Application Execution Life Cycle and Spark UI/010 Review Deployment Process of Spark Application.mp42.5 MiB
.pad/284515.54 KiB
[TutsNode.com] - Apache Spark 2 and 3 using Python 3 (Formerly CCA 175)/22 - Setup SSH Proxy to access Spark Application logs/009 Setup PuTTY and PuTTYgen on Windows.mp42.02 MiB