`dpr-question_encoder-multiset-base`

Model Details

Model Description: Dense Passage Retrieval (DPR) is a set of tools and models for state-of-the-art open-domain Q&A research. dpr-question_encoder-multiset-base is the question encoder trained using the Natural Questions (NQ) dataset, TriviaQA, WebQuestions (WQ), and CuratedTREC (TREC).

Developed by: See GitHub repo for model developers
Model Type: BERT-based encoder
Language(s): CC-BY-NC-4.0, also see Code of Conduct
License: English
Related Models:
- dpr-ctx_encoder-multiset-base
- dpr-reader-multiset-base
- dpr-ctx_encoder-single-nq-base
- dpr-question_encoder-single-nq-base
- dpr-reader-single-nq-base
Resources for more information:
- Research Paper
- GitHub Repo
- Hugging Face DPR docs
- BERT Base Uncased Model Card

How to Get Started with the Model

Use the code below to get started with the model.

from Transformers import DPRQuestionEncoder, DPRQuestionEncoderTokenizer
tokenizer = DPRQuestionEncoderTokenizer.from_pretrained("facebook/dpr-question_encoder-multiset-base")
model = DPRQuestionEncoder.from_pretrained("facebook/dpr-question_encoder-multiset-base")
input_ids = tokenizer("Hello, is my dog cute ?", return_tensors="pt")["input_ids"]
embeddings = model(input_ids).pooler_output

Uses

Direct Use

dpr-question_encoder-multiset-base, dpr-ctx_encoder-multiset-base, and dpr-reader-multiset-base can be used for the task of open-domain question answering.

Misuse and Out-of-scope Use

The model should not be used to intentionally create hostile or alienating environments for people. In addition, the set of DPR models was not trained to be factual or true representations of people or events, and therefore using the models to generate such content is out-of-scope for the abilities of this model.

Risks, Limitations and Biases

CONTENT WARNING: Readers should be aware this section may contain content that is disturbing, offensive, and can propogate historical and current stereotypes.

Significant research has explored bias and fairness issues with language models (see, e.g., Sheng et al., 2021 and Bender et al., 2021). Predictions generated by the model can include disturbing and harmful stereotypes across protected classes; identity characteristics; and sensitive, social, and occupational groups.

Training

Training Data

This model was trained using the following datasets:

Natural Questions (NQ) dataset (Lee et al., 2019; Kwiatkowski et al., 2019)
TriviaQA (Joshi et al., 2017)
WebQuestions (WQ) (Berant et al., 2013)
CuratedTREC (TREC) (Baudiš & Šedivý, 2015)

Training Procedure

The training procedure is described in the associated paper:

Given a collection of M text passages, the goal of our dense passage retriever (DPR) is to index all the passages in a low-dimensional and continuous space, such that it can retrieve efficiently the top k passages relevant to the input question for the reader at run-time.

Our dense passage retriever (DPR) uses a dense encoder EP(·) which maps any text passage to a d- dimensional real-valued vectors and builds an index for all the M passages that we will use for retrieval. At run-time, DPR applies a different encoder EQ(·) that maps the input question to a d-dimensional vector, and retrieves k passages of which vectors are the closest to the question vector.

The authors report that for encoders, they used two independent BERT (Devlin et al., 2019) networks (base, un-cased) and use FAISS (Johnson et al., 2017) during inference time to encode and index passages. See the paper for further details on training, including encoders, inference, positive and negative passages, and in-batch negatives.

Evaluation

The following evaluation information is extracted from the associated paper.

Testing Data, Factors and Metrics

The model developers report the performance of the model on five QA datasets, using the top-k accuracy (k ∈ {20, 100}). The datasets were NQ, TriviaQA, WebQuestions (WQ), CuratedTREC (TREC), and SQuAD v1.1.

Results

Top 20	Top 100
NQ	TriviaQA	WQ	TREC	SQuAD	NQ	TriviaQA	WQ	TREC	SQuAD
79.4	78.8	75.0	89.1	51.6	86.0	84.7	82.9	93.9	67.6

Environmental Impact

Carbon emissions can be estimated using the Machine Learning Impact calculator presented in Lacoste et al. (2019). We present the hardware type and based on the associated paper.

Hardware Type: 8 32GB GPUs
Hours used: Unknown
Cloud Provider: Unknown
Compute Region: Unknown
Carbon Emitted: Unknown

Technical Specifications

See the associated paper for details on the modeling architecture, objective, compute infrastructure, and training details.

Citation Information

  @inproceedings{karpukhin-etal-2020-dense,
    title = "Dense Passage Retrieval for Open-Domain Question Answering",
    author = "Karpukhin, Vladimir and Oguz, Barlas and Min, Sewon and Lewis, Patrick and Wu, Ledell and Edunov, Sergey and Chen, Danqi and Yih, Wen-tau",
    booktitle = "Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)",
    month = nov,
    year = "2020",
    address = "Online",
    publisher = "Association for Computational Linguistics",
    url = "https://www.aclweb.org/anthology/2020.emnlp-main.550",
    doi = "10.18653/v1/2020.emnlp-main.550",
    pages = "6769--6781",
}

Model Card Authors

This model card was written by the team at Hugging Face.

facebook/dpr-question_encoder-multiset-base浏览人数已经达到522，如你需要查询该站的相关权重信息，可以点击"5118数据""爱站数据""Chinaz数据"进入；以目前的网站数据参考，建议大家请以爱站数据为准，更多网站价值评估因素如：facebook/dpr-question_encoder-multiset-base的访问速度、搜索引擎收录以及索引量、用户体验等；当然要评估一个站的价值，最主要还是需要根据您自身的需求以及需要，一些确切的数据则需要找facebook/dpr-question_encoder-multiset-base的站长进行洽谈提供。如该站的IP、PV、跳出率等！

特别声明

本站Ai导航提供的facebook/dpr-question_encoder-multiset-base都来源于网络，不保证外部链接的准确性和完整性，同时，对于该外部链接的指向，不由Ai导航实际控制，在2023年5月9日下午7:15收录时，该网页上的内容，都属于合规合法，后期网页的内容如出现违规，可以直接联系网站管理员进行删除，Ai导航不承担任何责任。

Ai导航致力于优质、实用的网络站点资源收集与分享！本文地址https://www.ainavpro.com/sites/3032.html转载请注明

sonoisa/sentence-bert-base-ja-mean-tokens

This is a Japanese sentence...

allegro/herbert-large-cased

HerBERT HerBERT is a BERT...

google/vit-base-patch16-224-in21k

Vision Transformer (base-si...

megagonlabs/transformers-ud-japanese-electra-base-ginza-510

transformers-ud-japanese-el...

voidism/diffcse-bert-base-uncased-sts

DiffCSE: Difference-based C...

microsoft/wavlm-large

WavLM-Large Microsoft's W...

暂无评论

暂无评论...

facebook/dpr-question_encoder-multiset-base

`dpr-question_encoder-multiset-base`

Table of Contents

Model Details

How to Get Started with the Model

Uses

Direct Use

Misuse and Out-of-scope Use

Risks, Limitations and Biases

Training

Training Data

Training Procedure

Evaluation

Testing Data, Factors and Metrics

Results

Environmental Impact

Technical Specifications

Citation Information

Model Card Authors

数据统计

数据评估

相关导航

暂无评论

热门标签

随机网址

facebook/dpr-question_encoder-multiset-base

dpr-question_encoder-multiset-base

Table of Contents

Model Details

How to Get Started with the Model

Uses

Direct Use

Misuse and Out-of-scope Use

Risks, Limitations and Biases

Training

Training Data

Training Procedure

Evaluation

Testing Data, Factors and Metrics

Results

Environmental Impact

Technical Specifications

Citation Information

Model Card Authors

数据统计

数据评估

相关导航

暂无评论

热门标签

随机网址

广告位

`dpr-question_encoder-multiset-base`