Update README.md
Browse files
README.md
CHANGED
@@ -78,6 +78,8 @@ The model is intended for commercial use for Java programming tasks. The model p
|
|
78 |
3. Code generation/Completion task in Java
|
79 |
4. FIM task in Java
|
80 |
|
|
|
|
|
81 |
### Generation
|
82 |
```Java
|
83 |
# pip install -q transformers
|
@@ -93,7 +95,17 @@ inputs = tokenizer.encode("public class HelloWorld {\n public static void mai
|
|
93 |
outputs = model.generate(inputs)
|
94 |
print(tokenizer.decode(outputs[0]))
|
95 |
```
|
96 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
97 |
* _Using 8-bit precision (int8)_
|
98 |
|
99 |
```java
|
|
|
78 |
3. Code generation/Completion task in Java
|
79 |
4. FIM task in Java
|
80 |
|
81 |
+
## Sample inference code
|
82 |
+
|
83 |
### Generation
|
84 |
```Java
|
85 |
# pip install -q transformers
|
|
|
95 |
outputs = model.generate(inputs)
|
96 |
print(tokenizer.decode(outputs[0]))
|
97 |
```
|
98 |
+
### Fill-in-the-middle
|
99 |
+
Fill-in-the-middle uses special tokens to identify the prefix/middle/suffix part of the input and output:
|
100 |
+
|
101 |
+
```Java
|
102 |
+
input_text = "<fim_prefix>public class PalindromeChecker {\n public static boolean isPalindrome(String str) {\n <fim_suffix>return true;\n }\n<fim_middle>"
|
103 |
+
inputs = tokenizer.encode(input_text, return_tensors="pt").to(device)
|
104 |
+
outputs = model.generate(inputs)
|
105 |
+
print(tokenizer.decode(outputs[0]))
|
106 |
+
```
|
107 |
+
|
108 |
+
### Quantized Versions through `bitsandbytes`
|
109 |
* _Using 8-bit precision (int8)_
|
110 |
|
111 |
```java
|