为什么只在LLMEmbedder的encode前加下划线，前面的模型没有 #1114

dream-tentacle · 2024-09-20T06:49:20Z

FlagEmbedding/FlagEmbedding/flag_models.py

Line 516 in 274f4c0

 def _encode(self, sentences: Union[List[str], str], batch_size: int = 256, max_length: int = 512) -> np.ndarray: 

是否应该删除下划线以保持一致性

ZiyiXia · 2024-09-21T13:34:06Z

在python函数命名标准里，一般以单下划线起命名的函数是供内部其他函数调用的，不直接在API中使用
参考PEP 8 – Style Guide for Python Code：

_single_leading_underscore: weak “internal use” indicator. E.g. from M import * does not import objects whose names start with an underscore.

LLMEmbedder是根据6个具体任务分别用不同的query instruction和key instruction进行微调的，所以在encode时需要针对不同任务对query和key选择不同的instruction，建议直接使用函数encode_queries()和encode_keys()（他们都分别调用了_encode()），用法可以参考LLMEmbedder的README

dream-tentacle · 2024-09-22T01:50:35Z

@ZiyiXia 谢谢，但我是和前面其他模型（FlagLLMModel、FlagModel）对比的，它们的接口看起来都是差不多的，如果是这样的话前面两个是否应该加上下划线？谢谢！

dream-tentacle closed this as completed Sep 22, 2024

dream-tentacle reopened this Sep 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

为什么只在LLMEmbedder的encode前加下划线，前面的模型没有 #1114

为什么只在LLMEmbedder的encode前加下划线，前面的模型没有 #1114

dream-tentacle commented Sep 20, 2024

ZiyiXia commented Sep 21, 2024

dream-tentacle commented Sep 22, 2024

为什么只在LLMEmbedder的encode前加下划线，前面的模型没有 #1114

为什么只在LLMEmbedder的encode前加下划线，前面的模型没有 #1114

Comments

dream-tentacle commented Sep 20, 2024

ZiyiXia commented Sep 21, 2024

dream-tentacle commented Sep 22, 2024