🙌 Get a free 1-year Subscription
Purchase your second Modal course and we'll set you up with a free 1-year subscription. Learn More.

Big Data with Spark and Python

Learn the fundamentals of big data with Spark and Python! Process and clean data and build a machine learning pipeline. Modal  - A better way to learn technical skills.
When
December 9, 2024 - January 19, 2025
Registration closes on November 27, 2024
Course Tuition
$1,950
Want to take more than one course? Send an email to support@modal.com to buy our 1-year subscription for $3,900.
No Up-front Payment
Modal now offers a deferred direct bill payment option for Booz Allen employees.
Learn more
Who Is This For?

Data Scientists interested in building a foundational skill set for working with big data.

Any prerequisites?
  • A developed understanding of Python syntax, data structures, Pandas DataFrames, NumPy, and Scikit-Learn.
  • Familiarity with Supervised Machine Learning topics, such as the components of, and process for building, a regression ML model.
  • Statistics, especially probability, and applying these concepts using Python.
  • Linear algebra concepts like matrices and vector spaces.
What will I be able to do after this Course?
  • Connect to Spark, import data as RDDs and DataFrames, and perform basic transformations and actions on RDDs.
  • Use PySpark and Spark SQL to clean, manipulate and analyze DataFrames.
  • Use MLlib to preprocess data, and create, train, and evaluate a machine learning model.
NEED HELP DECIDING?
Book time with a learning expert.

A Typical Week

Monday
Self Study
Kick-off new topic with self-study & online learning
  • Coaches support learners hitting roadblocks
  • Manager check-in to bring learning into company context
Tuesday
Wednesday
Labs
Learning material leads into practice environment & labs
  • Coaches support learners hitting roadblocks
  • Pair programming to bring learning into company context
  • Community allows students to help each other
Thursday
Live Event
Interactive live session hosted by Coaches
  • Community allows students to help each other
  • Community Groups host expert AMAs & guided community discussions
Friday
Projects
Work on a weekly project
  • Community allows students to help each other
  • Group projects
  • Coaches support learners hitting roadblocks
Saturday
sunday
Work at your own pace
Expert coaching and actionable feedback from Coaches
    Modal clover loop

    Course Schedule

    Live Sessions every
    Sprint 1: Welcome to Spark
    Meet Distributed Discounts, a membership-based, bulk retail company! You’ll help DD format their trove of data in a way that can be used by Spark for analysis, and create both RDDs and DataFrames to perform basic actions and transformations on the data.
    Sprint 2: Data Cleaning and Analysis with PySpark and SQL
    Now that Distributed Discounts' data has been imported, you’ll answer key business questions about their customers demographics, behavior, and purchases using Spark SQL.
    Sprint 3: Machine Learning with Spark
    Create a machine learning model to predict whether new customers will purchase another item from Distributed Discounts.

    Why Modal?

    Projects & Practice
    Real world exercises contextualize learning in real-world context.
    On-Demand Coach Support
    You are never alone. Coaches are always present and can help you!
    Live Sessions
    Hear from guest speakers and expert instructors through engaging lectures.
    Technical Labs
    Technical Labs
    Hands-on labs allow you to play with new tools and concepts to build real skills.
    Modal Community
    Community of Peers
    You will be part of a learning community were support is abundant.
    Asynchronous Learning
    Asynchronous Learning
    Self-paced learning is scheduled for each learner, with a dashboard to help you keep on track.

    Other Courses

    “I love the quantity & quality of learning materials, the interactivity, the live sessions, the coaches, are invaluable. I can really feel the difference in the level of engagement that Modal has to every participant compared to an ordinary course."

    Modal clover colored
    - Veselina Stoyanova - Reporting Analyst, EMAG

    Learn more about FlexEd

    We are excited that Modal now offers a deferred direct bill payment option for Booz Allen employees.

    The deferred direct bill payment option enables employees to enroll in learning opportunities with no upfront costs. This payment option will require the employee to sign a Family Educational Rights and Privacy Act (FERPA) agreement with Modal to release grades/completion to Booz Allen to satisfy the FlexEd Program completion requirement.

    Note, Modal may also be used for the FlexEd Program reimbursement payment option. See the full FlexEd Program Policy & FAQs.
    Learn more about FlexEd
    Coming Soon!
    Check back in a few weeks or reach out to support@modal.io if you have questions.
    Need help? Contact us