Retrieve the complete content of a specific Apache Spark documentation page by providing its relative path, enabling direct access to detailed technical information.
Retrieve Spark driver logs from completed notebook executions to debug Python errors and analyze Spark behavior. Use stdout for Python tracebacks or stderr for system logs.
Create and submit Dataproc batch jobs for Spark, PySpark, or Spark SQL workloads on Google Cloud. Configure job parameters, dependencies, and execution environment.
Provides full-text search and retrieval tools for Apache Spark documentation using SQLite FTS5 with BM25 ranking. It enables AI assistants to efficiently search, filter by section, and read specific Spark documentation pages.
Enables comprehensive analysis of Apache Spark event logs from S3, HTTP, or local sources, providing performance metrics, resource monitoring, shuffle analysis, and automated optimization recommendations with interactive HTML reports.