-
Notifications
You must be signed in to change notification settings - Fork 38
[LLM Runtime] Enable phi-2&phi-1.5&phi-1 #78
Conversation
Signed-off-by: intellinjun <[email protected]>
Signed-off-by: intellinjun <[email protected]>
Signed-off-by: intellinjun <[email protected]>
Signed-off-by: intellinjun <[email protected]>
Signed-off-by: intellinjun <[email protected]>
Signed-off-by: intellinjun <[email protected]>
Signed-off-by: intellinjun <[email protected]>
Signed-off-by: intellinjun <[email protected]>
Signed-off-by: intellinjun <[email protected]>
Signed-off-by: intellinjun <[email protected]>
Signed-off-by: intellinjun <[email protected]>
Signed-off-by: intellinjun <[email protected]>
Signed-off-by: intellinjun <[email protected]>
Signed-off-by: intellinjun <[email protected]>
Signed-off-by: intellinjun <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
Could you post the phi2’s performance data here ? |
please also change https://github.com/intel/neural-speed/blob/main/docs/supported_models.md |
Does its model architecture have any differences compared with other GPT-liked models ( |
Signed-off-by: intellinjun <[email protected]>
use partial rope with parameter "partial_rotary_factor" |
done |
how about the performance? |
|
here is performance test result |
<style> </style>
|
Type of Change
feature or bug fix or documentation or others
API changed or not
not
Description
Model enabling
detail description