Users Online

· Guests Online: 87

· Members Online: 0

· Total Members: 188
· Newest Member: meenachowdary055

Forum Threads

Newest Threads
No Threads created
Hottest Threads
No Threads created

Latest Articles

Python Project - Learn to build your first chatbot using NLTK & Keras

Python Chatbot Project – Learn to build your first chatbot using NLTK & Keras

 

 

Hey Siri, What’s the meaning of Life?

As per all the evidence, it’s chocolate for you.

Soon as I heard this reply from Siri, I knew I found a perfect partner to savour my hours of solitude. From stupid questions to some pretty serious advice, Siri has been always there for me.

How amazing it is to tell someone everything and anything and not being judged at all. A top class feeling it is and that’s what the beauty of a chatbot is.

 

 

What is Chatbot?

A chatbot is an intelligent piece of software that is capable of communicating and performing actions similar to a human. Chatbots are used a lot in customer interaction, marketing on social network sites and instantly messaging the client. There are two basic types of chatbot models based on how they are built; Retrieval based and Generative based models.

Python chatbot project

1. Retrieval based Chatbots

A retrieval-based chatbot uses predefined input patterns and responses. It then uses some type of heuristic approach to select the appropriate response. It is widely used in the industry to make goal-oriented chatbots where we can customize the tone and flow of the chatbot to drive our customers with the best experience.

2. Generative based Chatbots

Generative models are not based on some predefined responses.

They are based on seq 2 seq neural networks. It is the same idea as machine translation. In machine translation, we translate the source code from one language to another language but here, we are going to transform input into an output. It needs a large amount of data and it is based on Deep Neural networks.

 

About the Python Project – Chatbot

In this Python project with source code, we are going to build a chatbot using deep learning techniques. The chatbot will be trained on the dataset which contains categories (intents), pattern and responses. We use a special recurrent neural network (LSTM) to classify which category the user’s message belongs to and then we will give a random response from the list of responses.

Let’s create a retrieval based chatbot using NLTK, Keras, Python, etc.

The Dataset

The dataset we will be using is ‘intents.json’. This is a JSON file that contains the patterns we need to find and the responses we want to return to the user. The link to the project is available below:

Python Chatbot Project Dataset


Prerequisites

The project requires you to have good knowledge of Python, Keras, and Natural language processing (NLTK). Along with them, we will use some helping modules which you can download using the python-pip command.

  1. pip install tensorflow, keras, pickle, nltk

 

How to Make Chatbot in Python?

Now we are going to build the chatbot using Python but first, let us see the file structure and the type of files we will be creating:

Types of files - Python Chatbot

  • Intents.json – The data file which has predefined patterns and responses.
  • train_chatbot.py – In this Python file, we wrote a script to build the model and train our chatbot.
  • Words.pkl – This is a pickle file in which we store the words Python object that contains a list of our vocabulary.
  • Classes.pkl – The classes pickle file contains the list of categories.
  • Chatbot_model.h5 – This is the trained model that contains information about the model and has weights of the neurons.
  • Chatgui.py – This is the Python script in which we implemented GUI for our chatbot. Users can easily interact with the bot.

Here are the 5 steps to create a chatbot in Python from scratch:

  1. Import and load the data file
  2. Preprocess data
  3. Create training and testing data
  4. Build the model
  5. Predict the response

1. Import and load the data file

First, make a file name as train_chatbot.py. We import the necessary packages for our chatbot and initialize the variables we will use in our Python project.

The data file is in JSON format so we used the json package to parse the JSON file into Python.

  1. import nltk
  2. from nltk.stem import WordNetLemmatizer
  3. lemmatizer = WordNetLemmatizer() 
  4. import json
  5. import pickle
  6.  
  7. import numpy as np
  8. from keras.models import Sequential
  9. from keras.layers import Dense, Activation, Dropout
  10. from keras.optimizers import SGD
  11. import random
  12.  
  13. words=[] 
  14. classes = [] 
  15. documents = [] 
  16. ignore_words = ['?', '!'] 
  17. data_file = open('intents.json').read() 
  18. intents = json.loads(data_file)

This is how our intents.json file looks like.

intents data file - Python chatbot project

2. Preprocess data

When working with text data, we need to perform various preprocessing on the data before we make a machine learning or a deep learning model. Tokenizing is the most basic and first thing you can do on text data. Tokenizing is the process of breaking the whole text into small parts like words.

Here we iterate through the patterns and tokenize the sentence using nltk.word_tokenize() function and append each word in the words list. We also create a list of classes for our tags.

  1. for intent in intents['intents']:
  2. for pattern in intent['patterns']:
  3.  
  4. #tokenize each word
  5. w = nltk.word_tokenize(pattern)
  6. words.extend(w)
  7. #add documents in the corpus
  8. documents.append((w, intent['tag']))
  9. # add to our classes list
  10. if intent['tag'] not in classes:
  11. classes.append(intent['tag'])

Now we will lemmatize each word and remove duplicate words from the list. Lemmatizing is the process of converting a word into its lemma form and then creating a pickle file to store the Python objects which we will use while predicting.

  1. # lemmatize, lower each word and remove duplicates
  2. words = [lemmatizer.lemmatize(w.lower()) for w in words if w not in ignore_words]
  3. words = sorted(list(set(words)))
  4. # sort classes
  5. classes = sorted(list(set(classes)))
  6. # documents = combination between patterns and intents
  7. print (len(documents), "documents")
  8. # classes = intents
  9. print (len(classes), "classes", classes)
  10. # words = all words, vocabulary
  11. print (len(words), "unique lemmatized words", words)
  12. pickle.dump(words,open('words.pkl','wb'))
  13. pickle.dump(classes,open('classes.pkl','wb'))

3. Create training and testing data

Now, we will create the training data in which we will provide the input and the output. Our input will be the pattern and output will be the class our input pattern belongs to. But the computer doesn’t understand text so we will convert text into numbers.

  1. # create our training data
  2. training = []
  3. # create an empty array for our output
  4. output_empty = [0] * len(classes)
  5. # training set, bag of words for each sentence
  6. for doc in documents:
  7. # initialize our bag of words
  8. bag = []
  9. # list of tokenized words for the pattern
  10. pattern_words = doc[0]
  11. # lemmatize each word - create base word, in attempt to represent related words
  12. pattern_words = [lemmatizer.lemmatize(word.lower()) for word in pattern_words]
  13. # create our bag of words array with 1, if word match found in current pattern
  14. for w in words:
  15. bag.append(1) if w in pattern_words else bag.append(0)
  16. # output is a '0' for each tag and '1' for current tag (for each pattern)
  17. output_row = list(output_empty)
  18. output_row[classes.index(doc[1])] = 1
  19. training.append([bag, output_row])
  20. # shuffle our features and turn into np.array
  21. random.shuffle(training)
  22. training = np.array(training)
  23. # create train and test lists. X - patterns, Y - intents
  24. train_x = list(training[:,0])
  25. train_y = list(training[:,1])
  26. print("Training data created")

Want to master Python programming?

Check out 270+ Python Tutorials by DataFlair

4. Build the model

We have our training data ready, now we will build a deep neural network that has 3 layers. We use the Keras sequential API for this. After training the model for 200 epochs, we achieved 100% accuracy on our model. Let us save the model as ‘chatbot_model.h5’.

  1. # Create model - 3 layers. First layer 128 neurons, second layer 64 neurons and 3rd output layer contains number of neurons
  2. # equal to number of intents to predict output intent with softmax
  3. model = Sequential()
  4. model.add(Dense(128, input_shape=(len(train_x[0]),), activation='relu'))
  5. model.add(Dropout(0.5))
  6. model.add(Dense(64, activation='relu'))
  7. model.add(Dropout(0.5))
  8. model.add(Dense(len(train_y[0]), activation='softmax'))
  9. # Compile model. Stochastic gradient descent with Nesterov accelerated gradient gives good results for this model
  10. sgd = SGD(lr=0.01, decay=1e-6, momentum=0.9, nesterov=True)
  11. model.compile(loss='categorical_crossentropy', optimizer=sgd, metrics=['accuracy'])
  12. #fitting and saving the model
  13. hist = model.fit(np.array(train_x), np.array(train_y), epochs=200, batch_size=5, verbose=1)
  14. model.save('chatbot_model.h5', hist)
  15. print("model created")

5. Predict the response (Graphical User Interface)

Now to predict the sentences and get a response from the user to let us create a new file ‘chatapp.py’.

We will load the trained model and then use a graphical user interface that will predict the response from the bot. The model will only tell us the class it belongs to, so we will implement some functions which will identify the class and then retrieve us a random response from the list of responses.

Again we import the necessary packages and load the ‘words.pkl’ and ‘classes.pkl’ pickle files which we have created when we trained our model:

  1. import nltk
  2. from nltk.stem import WordNetLemmatizer
  3. lemmatizer = WordNetLemmatizer()
  4. import pickle
  5. import numpy as np
  6. from keras.models import load_model
  7. model = load_model('chatbot_model.h5')
  8. import json
  9. import random
  10. intents = json.loads(open('intents.json').read())
  11. words = pickle.load(open('words.pkl','rb'))
  12. classes = pickle.load(open('classes.pkl','rb'))

To predict the class, we will need to provide input in the same way as we did while training. So we will create some functions that will perform text preprocessing and then predict the class.

  1. def clean_up_sentence(sentence):
  2. # tokenize the pattern - split words into array
  3. sentence_words = nltk.word_tokenize(sentence)
  4. # stem each word - create short form for word
  5. sentence_words = [lemmatizer.lemmatize(word.lower()) for word in sentence_words]
  6. return sentence_words
  7. # return bag of words array: 0 or 1 for each word in the bag that exists in the sentence
  8. def bow(sentence, words, show_details=True):
  9. # tokenize the pattern
  10. sentence_words = clean_up_sentence(sentence)
  11. # bag of words - matrix of N words, vocabulary matrix
  12. bag = [0]*len(words)
  13. for s in sentence_words:
  14. for i,w in enumerate(words):
  15. if w == s:
  16. # assign 1 if current word is in the vocabulary position
  17. bag[i] = 1
  18. if show_details:
  19. print ("found in bag: %s" % w)
  20. return(np.array(bag))
  21. def predict_class(sentence, model):
  22. # filter out predictions below a threshold
  23. p = bow(sentence, words,show_details=False)
  24. res = model.predict(np.array([p]))[0]
  25. ERROR_THRESHOLD = 0.25
  26. results = [[i,r] for i,r in enumerate(res) if r>ERROR_THRESHOLD]
  27. # sort by strength of probability
  28. results.sort(key=lambda x: x[1], reverse=True)
  29. return_list = []
  30. for r in results:
  31. return_list.append({"intent": classes[r[0]], "probability": str(r[1])})
  32. return return_list

After predicting the class, we will get a random response from the list of intents.

  1. def getResponse(ints, intents_json):
  2. tag = ints[0]['intent']
  3. list_of_intents = intents_json['intents']
  4. for i in list_of_intents:
  5. if(i['tag']== tag):
  6. result = random.choice(i['responses'])
  7. break
  8. return result
  9. def chatbot_response(text):
  10. ints = predict_class(text, model)
  11. res = getResponse(ints, intents)
  12. return res

Now we will code a graphical user interface. For this, we use the Tkinter library which already comes in python. We will take the input message from the user and then use the helper functions we have created to get the response from the bot and display it on the GUI. Here is the full source code for the GUI.

  1. #Creating GUI with tkinter
  2. import tkinter
  3. from tkinter import *
  4. def send():
  5. msg = EntryBox.get("1.0",'end-1c').strip()
  6. EntryBox.delete("0.0",END)
  7. if msg != '':
  8. ChatLog.config(state=NORMAL)
  9. ChatLog.insert(END, "You: " + msg + '\n\n')
  10. ChatLog.config(foreground="#442265", font=("Verdana", 12 ))
  11. res = chatbot_response(msg)
  12. ChatLog.insert(END, "Bot: " + res + '\n\n')
  13. ChatLog.config(state=DISABLED)
  14. ChatLog.yview(END)
  15. base = Tk()
  16. base.title("Hello")
  17. base.geometry("400x500")
  18. base.resizable(width=FALSE, height=FALSE)
  19. #Create Chat window
  20. ChatLog = Text(base, bd=0, bg="white", height="8", width="50", font="Arial",)
  21. ChatLog.config(state=DISABLED)
  22. #Bind scrollbar to Chat window
  23. scrollbar = Scrollbar(base, command=ChatLog.yview, cursor="heart")
  24. ChatLog['yscrollcommand'] = scrollbar.set
  25. #Create Button to send message
  26. SendButton = Button(base, font=("Verdana",12,'bold'), text="Send", width="12", height=5,
  27. bd=0, bg="#32de97", activebackground="#3c9d9b",fg='#ffffff',
  28. command= send )
  29. #Create the box to enter message
  30. EntryBox = Text(base, bd=0, bg="white",width="29", height="5", font="Arial")
  31. #EntryBox.bind("<Return>", send)
  32. #Place all components on the screen
  33. scrollbar.place(x=376,y=6, height=386)
  34. ChatLog.place(x=6,y=6, height=386, width=370)
  35. EntryBox.place(x=128, y=401, height=90, width=265)
  36. SendButton.place(x=6, y=401, height=90)
  37. base.mainloop()

6. Run the chatbot

To run the chatbot, we have two main files; train_chatbot.py and chatapp.py.

First, we train the model using the command in the terminal:

  1. python train_chatbot.py

If we don’t see any error during training, we have successfully created the model. Then to run the app, we run the second file.

  1. python chatgui.py

The program will open up a GUI window within a few seconds. With the GUI you can easily chat with the bot.

Screenshots:

Python chatbot result

chat between user & bot - Python Chatbot

Summary

In this Python data science project, we understood about chatbots and implemented a deep learning version of a chatbot in Python which is accurate. You can customize the data according to business requirements and train the chatbot with great accuracy. Chatbots are used everywhere and all businesses is looking forward to implementing bot in their workflow.

 

 

 


Comments

No Comments have been Posted.

Post Comment

Please Login to Post a Comment.

Ratings

Rating is available to Members only.

Please login or register to vote.

No Ratings have been Posted.
Render time: 0.76 seconds
10,817,859 unique visits