casperhansen
/

mpt-7b-8k-chat-awq

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions Community

mpt-7b-8k-chat-awq / custom_embedding.py

casperhansen's picture

MPT 7B 8K quantized

5c660fe 11 months ago

raw history blame contribute delete

No virus

305 Bytes

	import torch
	import torch.nn as nn
	import torch.nn.functional as F
	from torch import Tensor

	class SharedEmbedding(nn.Embedding):

	def forward(self, input: Tensor, unembed: bool=False) -> Tensor:
	if unembed:
	return F.linear(input, self.weight)
	return super().forward(input)