In order to add a full pipeline, you'll have to follow these steps in this order:
- Data prep
- split your raw data into individual data assets
- split these assets between train and test
- Add dataset
- write an opener following the requirements
- write a text describing the dataset, its structure and content
- write and push a json description of the dataset
- Add the prepared data assets
- Add objective
- write a metrics file following the requirements
- write a text describing the objective and how it's evaluated
- write and push json description of the objective
- Add algo
- write your algorithm script following the requirements
- write a targz archive with Dockerfile and algo script
- write a text describing the algo
- write and push a json description of the algorithm
- Add a traintuple to launch training task
You can see a full example of this in the Titanic example