streamsets/tutorials

tutorial-processor doumentation required updates

Akhi1 opened this issue · 1 comments

Akhi1 commented

HI @metadaddy, I'm following "Creating a Custom StreamSets Processor" tutorial as follows
https://github.com/streamsets/tutorials/tree/master/tutorial-processor

and it mentions a pipeline, based on the "SDC taxi data tutorial" - but unfortunately the href says content not found here https://streamsets.com/documentation/datacollector/latest/help/Tutorial/Overview.html

can you please check this and update?

also, while I was building the template using

$ mvn clean package -DskipTests

it failed with the message "The POM for com.streamsets:streamsets-datacollector-api:jar:1.3.0.0-SNAPSHOT is missing, no dependency information available"

I then had to manually update the streamsets.version to 3.1.0.0 (latest) in the pom.xml as follows

 <properties>
    <streamsets.version>3.1.0.0</streamsets.version>
    <slf4j.version>1.7.7</slf4j.version>
    <junit.version>4.12</junit.version>
    <project.build.sourceEncoding>UTF-8</project.build.sourceEncoding>
  </properties>

  <dependencies>
    <dependency>
      <groupId>com.streamsets</groupId>
      <artifactId>streamsets-datacollector-api</artifactId>
      <version>3.1.0.0</version>
      <scope>provided</scope>
  </dependency>
Akhi1 commented

And I think it's also missing "how to import the stage to streamsets" for those who are using "streamsets docker container version"

Login to streamsets,
go to package manager > external libraries > install external libraries > select the tarball generated in the previous step (mvn build) upload and restart docker container