regex - How do you remove text from example sets before processing the data? -
i using rapidminer 5.3.013. reading excel file thousands of rows of worklogs remedy. want remove texts based upon regex ^[a-z][\w\d/?(# ]+[\w0-9#)]{2}:
use process documents data. far have not figured out how this. write vba, know how can done in rapidminer.
having read excel data, make sure field processed process documents operator set type text. using nominal text operator. inside process documents loop, split data tokens using tokenize operator. use filter tokens operator remove tokens don't want. operator takes regular expression parameter. make sure invert flag set on operator remove tokens don't want rather keep them
Comments
Post a Comment