Bump sbt/setup-sbt from 1.1.22 to 1.2.1 (#261)
Bumps sbt/setup-sbt from 1.1.22 to 1.2.1.
updated-dependencies:
- dependency-name: sbt/setup-sbt dependency-version: 1.2.1 dependency-type: direct:production update-type: version-update:semver-minor …
Signed-off-by: dependabot[bot] support@github.com Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
版权所有:中国计算机学会技术支持:开源发展技术委员会
京ICP备13000930号-9
京公网安备 11010802047560号
Apache Toree
Apache Toree is a Juypter Notebook kernel. The main goal of Toree is to provide the foundation for interactive applications that connect to and use Apache Spark using Scala language.
Overview
Toree provides an interface that allows clients to interact with a Spark Cluster. Clients can send libraries and snippets of code that are interpreted and executed using a preconfigured Spark context. These snippets can do a variety of things:
Apache Toree supports the
Scalaprogramming language. It implements the latest Jupyter message protocol (5.0), so it can easily plug into the latest releases of Jupyter/IPython (3.2.x+ and up) for quick, interactive data exploration.Develop
This project uses
makeas the entry point for build, test, and packaging. To perform a local build, you need to installsbt,jupyter/ipython, and other development requirements locally on your machine.To build and interact with Toree using Jupyter, run
This will start a Jupyter notebook server. Depending on your mode, it will be accessible at
http://localhost:8888orhttp://192.168.44.44:8888. From here you can create notebooks that use Toree configured for Spark local mode.Tests can be run by doing
make test.Build & Package
To build and package up Toree, run
This results in 2 packages.
./dist/toree-<VERSION>-binary-release.tar.gzis a simple package that contains JAR and executable./dist/toree-<VERSION>.tar.gzis apipinstallable package that adds Toree as a Jupyter kernel.NOTE:
make releaseusesdocker. Please refer todockerinstallation instructions for your system.Building Individual Components
Main Toree Assembly
To build just the main Toree assembly jar (without spark-monitor-plugin):
This creates:
target/scala-2.12/toree-assembly-<VERSION>.jarSpark Monitor Plugin
To build the spark-monitor-plugin as a separate jar:
This creates:
spark-monitor-plugin/target/scala-2.12/spark-monitor-plugin-<VERSION>.jarBuild All Components
To compile all projects including both the main assembly and spark-monitor-plugin:
Note: The spark-monitor-plugin is now built as a separate jar and is not included in the main Toree assembly.
Using the Spark Monitor Plugin
To enable the Spark Monitor Plugin in your Toree application, you need to specify the path to the plugin JAR when starting Toree:
Option 1: Command Line Parameter
Option 2: Jupyter Kernel Installation
When installing Toree as a Jupyter kernel, you can specify the plugin:
Option 3: Configuration File
You can also specify the plugin in a configuration file and use the
--profileoption:Then start with:
java -jar toree-assembly.jar --profile config.jsonImportant:
--magic-urlparameter.Run Examples
To play with the example notebooks, run
A notebook server will be launched in a
Dockercontainer with Toree and some other dependencies installed. Refer to yourDockersetup for the ip address. The notebook will be athttp://<ip>:8888/.Install
This requires you to have a distribution of Apache Spark downloaded to the system where Apache Toree will run. The following commands will install Apache Toree.
Dev snapshots of Toree are located at https://dist.apache.org/repos/dist/dev/incubator/toree. To install using one of those packages, you can use the following:
where
PIP_RELEASE_URLis one of thepippackages. For example:Reporting Issues
Refer to and open issue here
Communication
You can reach us through gitter or our mailing list
Version
We are working on publishing binary releases of Toree soon. As part of our move into Apache Incubator, Toree will start a new version sequence starting at
0.1.Our goal is to keep
masterup to date with the latest version of Spark. When new versions of Spark require specific code changes to Toree, we will branch out older Spark version support.As it stands, we maintain several branches for legacy versions of Spark. The table below shows what is available now.
Please note that for the most part, new features will mainly be added to the
masterbranch.Resources
We are currently enhancing our documentation, which is available in our website.