ToddGoldfarb
/

Cadet-Medium

@@ -6,13 +6,11 @@ language:
 - en
 pipeline_tag: conversational
 ---
-# What is Cadet-Tiny?
-Inspired by Allen AI's **Cosmo-XL**, **Cadet-Tiny** is a _very small_ conversational model trained off of the **SODA** dataset. **Cadet-Tiny** is intended for inference at the edge (on something as small as a 2GB RAM Raspberry Pi).
-**Cadet-Tiny** is trained off of the **t5-small** pretrained model from Google, and is, as a result, is about 2% of the size of the **Cosmo-3B** model.
-This is my first SEQ2SEQ NLP Model I've ever made! I'm very excited to share it here on HuggingFace! :)
 If you have any questions, or any comments on improvements, please contact me at:  **tcgoldfarb@gmail.com**
@@ -22,11 +20,11 @@ If you have any questions, or any comments on improvements, please contact me at
 Here is the link to the Google Colab file, where I walk through the process of training the model and using the SODA public dataset from AI2.
-https://colab.research.google.com/drive/1cx3Yujr_jGQkseqzXZW-2L0vEyEjds_s?usp=sharing
-# Get Started With Cadet-Tiny
-Use the code snippet below to get started with Cadet-Tiny!
 ```
 import torch
@@ -35,12 +33,12 @@ import colorful as cf
 cf.use_true_colors()
 cf.use_style('monokai')
-class CadetTinyAgent:
     def __init__(self):
-        print(cf.bold | cf.purple("Waking up Cadet-Tiny..."))
         self.device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
-        self.tokenizer = AutoTokenizer.from_pretrained("t5-small", model_max_length=512)
-        self.model = AutoModelForSeq2SeqLM.from_pretrained("ToddGoldfarb/Cadet-Tiny", low_cpu_mem_usage=True).to(self.device)
         self.conversation_history = ""
     def observe(self, observation):
@@ -98,36 +96,36 @@ class CadetTinyAgent:
             continue_chat = ""
             # MODIFY THESE STRINGS TO YOUR LIKING :)
-            situation_narrative = "Imagine you are Cadet-Tiny talking to ???."
-            role_instruction = "You are Cadet-Tiny, and you are talking to ???."
             self.chat(situation_narrative, role_instruction)
             continue_chat = get_valid_input(cf.purple("Start a new conversation with new setup? [Y/N]:"), "Y")
             if continue_chat in ["N", "n"]:
                 break
-        print(cf.blue("CT: See you!"))
     def chat(self, situation_narrative, role_instruction):
         print(cf.green(
-            "Cadet-Tiny is running! Input [RESET] to reset the conversation history and [END] to end the conversation."))
         while True:
             user_input = input("You: ")
             if user_input == "[RESET]":
                 self.reset_history()
-                print(cf.green("[Conversation history cleared. Chat with Cadet-Tiny!]"))
                 continue
             if user_input == "[END]":
                 break
             response = self.generate(situation_narrative, role_instruction, user_input)
-            print(cf.blue("CT: " + response))
 def main():
     print(cf.bold | cf.blue("LOADING MODEL"))
-    CadetTiny = CadetTinyAgent()
-    CadetTiny.run()
 if __name__ == '__main__':

 - en
 pipeline_tag: conversational
 ---
+# What is Cadet-Medium?
+Inspired by Allen AI's **Cosmo-XL**, **Cadet-Medium** is a somewhat small conversational model trained off of the **SODA** dataset. **Cadet-Medium** is intended for inference at the edge (on something as small as a 2GB RAM Raspberry Pi).
+**Cadet-Medium** is trained off of the **t5-base** pretrained model from Google.
 If you have any questions, or any comments on improvements, please contact me at:  **tcgoldfarb@gmail.com**
 Here is the link to the Google Colab file, where I walk through the process of training the model and using the SODA public dataset from AI2.
+https://colab.research.google.com/drive/1uekZ0gO3GqjPwno16tV1A4Gitrl7p3ur?usp=sharing
+# Get Started With Cadet-Medium
+Use the code snippet below to get started with Cadet-Medium!
 ```
 import torch
 cf.use_true_colors()
 cf.use_style('monokai')
+class CadetMedAgent:
     def __init__(self):
+        print(cf.bold | cf.purple("Waking up Cadet-Medium..."))
         self.device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
+        self.tokenizer = AutoTokenizer.from_pretrained("t5-base", model_max_length=512)
+        self.model = AutoModelForSeq2SeqLM.from_pretrained("ToddGoldfarb/Cadet-Medium", low_cpu_mem_usage=True).to(self.device)
         self.conversation_history = ""
     def observe(self, observation):
             continue_chat = ""
             # MODIFY THESE STRINGS TO YOUR LIKING :)
+            situation_narrative = "Imagine you are Cadet-Medium talking to ???."
+            role_instruction = "You are Cadet-Medium, and you are talking to ???."
             self.chat(situation_narrative, role_instruction)
             continue_chat = get_valid_input(cf.purple("Start a new conversation with new setup? [Y/N]:"), "Y")
             if continue_chat in ["N", "n"]:
                 break
+        print(cf.blue("CM: See you!"))
     def chat(self, situation_narrative, role_instruction):
         print(cf.green(
+            "Cadet-Medium is running! Input [RESET] to reset the conversation history and [END] to end the conversation."))
         while True:
             user_input = input("You: ")
             if user_input == "[RESET]":
                 self.reset_history()
+                print(cf.green("[Conversation history cleared. Chat with Cadet-Medium!]"))
                 continue
             if user_input == "[END]":
                 break
             response = self.generate(situation_narrative, role_instruction, user_input)
+            print(cf.blue("CM: " + response))
 def main():
     print(cf.bold | cf.blue("LOADING MODEL"))
+    CadetMed = CadetMedAgent()
+    CadetMed.run()
 if __name__ == '__main__':