147
Views
0
CrossRef citations to date
0
Altmetric
Research Article

A large-scale Chinese patent dataset for information extraction

, &
Article: 2365328 | Received 17 Jan 2024, Accepted 30 May 2024, Published online: 13 Jun 2024

Figures & data

Figure 1. The correspondence between entity and relation.

Figure 1. The correspondence between entity and relation.

Table 1. Entity types and relation types.

Table 2. Comparison of annotation tools.

Figure 2. Statistics of entity labels.

Figure 2. Statistics of entity labels.

Figure 3. Statistics of relation labels.

Figure 3. Statistics of relation labels.

Figure 4. Example of a sample (Named entity identification).

Figure 4. Example of a sample (Named entity identification).

Figure 5. Example of a sample (Relationship classification).

Figure 5. Example of a sample (Relationship classification).

Figure 6. Flowchart of the patent information extraction.

Figure 6. Flowchart of the patent information extraction.

Figure 7. The framework of the BiLSTM based NER method.

Figure 7. The framework of the BiLSTM based NER method.

Table 3. NER results of Bi-LSTM models.

Table 4. RE experimental results.

Table 5. SPO experimental results.

Data availability statement

The data can be available from the corresponding author upon reasonable request.