
- #Docker factory reset now failed to initialize install#
- #Docker factory reset now failed to initialize full#
At this point, we would see a dialog like the following:Ĭlick on ` StreamSets DataCollector was started from Docker` option highlighted above and copy the command as shown and click on cancel. In the web UI, go ahead and select a library and then click the “Install” icon (see documentation for details).
#Docker factory reset now failed to initialize install#
One common gotcha with running StreamSets in Docker happens when we need to install additional stage libraries. Once we are inside, we can run whatever commands we need and, when we’re done, can use exit (or CTRL+D) to come back to the host: $ docker exec -it sdc bash One simple way is to run the following command to start a Bash session inside the container: $ docker exec -it sdc bash Exploring the Docker ContainerĪfter we have created the Docker container, we might want to take a look around (e.g. Here are a few tricks I learned along the way which helped me a lot. For someone new, a great place to start would be our tutorials on GitHub that contain step-by-step guides for everything from creating and running a data pipeline to more advanced operations like data manipulation. Now that we have access to the web UI, we can start playing with all the cool capabilities that SDC has to offer. This will present a prompt to log in so sign in with your preferred option and then follow the instructions on the screen to link your Data Collector with your StreamSets account. We can verify that the service has started by using a web browser pointed to `localhost:18630` This is the host port to which Docker has published the Data Collector container’s port 18630. Voila! We have successfully created a Docker container with Data Collector. If all goes well, running docker ps will show output like the following: $ docker psĬONTAINER ID IMAGE COMMAND CREATED STATUS PORTS NAMESįd863929e26c streamsets/datacollector "." 12 minutes ago Up 12 minutes 0.0.0.0:18630->18630/tcp sdc Restart only if the container exits with a non-zero exit status Publish container’s port 18630 on host’s 18630
#Docker factory reset now failed to initialize full#
Here are the options we specified (for a full list, check out the image notes on Docker Hub): -dĬreate Docker container in the background in detached mode $ docker run -restart on-failure -p 18630:18630 -d -name sdc streamsets/datacollector To start a Docker container with the most recent release of StreamSets Data Collector just run the following command. As it turns out, the Docker images we publish for SDC were the easiest way to explore its vast set of features and capabilities, which is exactly why I am writing this blog post. ‘ Simplicity is the ultimate sophistication.’Īs a recent hire on the Engineering Productivity team here at StreamSets, my early days at the company were marked by efforts to dive head-first into StreamSets Data Collector (SDC), a fast data ingestion engine, to build data pipelines.
