Lightweight reranking model based on Google Gemma 2 architecture with 2.6B parameters, optimized for Chinese and English, runs on consumer-grade GPUs.