Search in:

Applied Artificial Intelligence

An International Journal

Volume 36, 2022 - Issue 1

Submit an article Journal homepage

Open access

1,997

Views

CrossRef citations to date

Altmetric

Listen

Research Article

Development and Evaluation of an Intelligent System for Calibrating Karaoke Lyrics Based on Fuzzy Petri Nets

Yi-Nan Lina Department of Electronic Engineering, Ming Chi University of Technology, New Taipei City, Taiwan, ROCView further author information

Cheng-Ying Yangb Department of Computer Science, University of Taipei, Taipei, Taiwan, ROCView further author information

Sheng-Kuan Wangc Department of Electrical Engineering, Ming Chi University of Technology, New Taipei City, Taiwan, ROCView further author information

Gwo-Jen Chioud Department of Electrical Engineering, National Formosa University, Yunlin County, Taiwan, ROCView further author information

Victor R.L. Shene Department of Computer Science and Information Engineering, National Taipei University, New Taipei City, Taiwan, ROC;g Department of Information Management, Chaoyang University of Technology, Taichung City, Taiwan, ROCCorrespondence[email protected] [email protected]
View further author information

Yi-Chih Tunga Department of Electronic Engineering, Ming Chi University of Technology, New Taipei City, Taiwan, ROCView further author information

Frank H.C. Shenf Department of Electronic Engineering, Fu Jen Catholic University, New Taipei City, Taiwan, ROCView further author information

Hung-Chi Chenge Department of Computer Science and Information Engineering, National Taipei University, New Taipei City, Taiwan, ROCView further author information

show all

Article: 2110699 | Received 30 May 2022, Accepted 03 Aug 2022, Published online: 22 Aug 2022

Cite this article
https://doi.org/10.1080/08839514.2022.2110699
CrossMark

In this article

ABSTRACT
Introduction
Literature Review
Proposed Approach
Experimental Results
Conclusion
Acknowledgements
Disclosure statement
Additional information
References

Full Article
Figures & data
References
Citations
Metrics
Licensing
Reprints & Permissions
View PDF PDF View EPUB EPUB

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

ABSTRACT

In the home entertainment system, karaoke is a popular leisure facility in our daily life. Via the karaoke system, users can sing along with the lyrics based on the recordings of pop songs. However, a lot of karaoke systems can display lyrics semi-automatically. Traditionally, some lyrics are input manually and need to be synchronized with the tonal music stepwise, which is time-consuming. One of the famous musical phrase segmentation theories is a generative theory of tonal music, through which we have implemented a karaoke system in C# programming language. This intelligent system can automatically segment music phrases and use a high-level fuzzy Petri net model to calibrate the lyrics in pop songs. Fifty Chinese pop songs are selected to evaluate its performance. The experimental results have shown that the average calibration precision value (92.78%) and recall value (90.46%) are highly acceptable.

Introduction

In the home entertainment system, karaoke playing has become a popular activity, which is enjoyed by many people. Via a karaoke system, users can sing along with lyrics based on the recordings of pop songs, and receive a score based on their performances (Tsai and Lee Citation2017). When reading the karaoke lyrics on the screen, users can recognize how to adjust their singing intonation to improve their scores, which is really simple and interesting. However, a difficulty in the traditional karaoke lyrics to be calibrated manually is encountered. It takes a large amount of time to segment the musical phrases, which is quite inefficient. Karaoke songs have become more difficult to make and the manufacturing costs are increasing. Thus, the selling price of a karaoke system is harder to be accepted by users.

In recent years, with advances in computer technology, the music albums have gradually been replaced by digital music purchased online. Nowadays, researchers suggest that automatically calibrating karaoke lyrics may be possible. Firstly, the musical phrase segmentation is required. One of the famous music segmentation theories is a generative theory of tonal music (GTTM) (Chan and Hsiao Citation2016; Frankland and Cohen Citation2018), which was conceived by a music theorist, Fred Lerdahl, and a linguist, Ray Jackendoff; and presented in the book with the same title. Grouping preference rules (GPRs) are established in GTTM for the purpose of making a music grouping structure (Goto Citation2016). Quantification and testing of the local grouping rules for four local GPRs are provided (Frankland and Cohen Citation2018). In the approach based on the preference rules, the sound onset detection is applied to find all onsets in the song (Wu and Cao Citation2018).

The high-level fuzzy Petri net (HLFPN) model (Chiang et al. Citation2020; Citation2021; Lin et al. Citation2020a) is now used to perform the music segmentation. It is based on Petri net theory and fuzzy reasoning. Petri net theory, proposed by Dr Carl Adam Petri in 1962, is a graphical and mathematical modeling tool, which is concurrent, asynchronous, distributed, parallel, non-deterministic, stochastic, and so on. It can be used to model and analyze various systems (Sung and Chang Citation2019). However, along with the advances in information system, the description of Petri nets is becoming more and more complex with use of fuzzy production rules (Matsumoto and Hori Citation2019). In general, a fuzzy production rule can be used to describe the fuzzy relationship between antecedent and consequent.

Due to the above reasons and motivation, the goal of this paper is to make a karaoke system which can automatically calibrate karaoke songs with lyrics. A C# program has been designed to implement GTTM and HLFPN model for the purpose of segmenting each musical phrase. Musical phrases are used to automatically calibrate the lyrics that form a database system, and music notes are automatically calibrated for a Chinese character which has only one syllable. Then, an automatic calibration system for Chinese karaoke lyrics is completed.

Problem Statement: Although the karaoke technology has advanced with multi-functions for many years, it still lacks a robust database system for musical analysis. The grouping preference rules are diversified (Frankland and Cohen Citation2018). Also, the technical analysis parameters are hard to make musical decision. Consequently, the original semi-automatic lyrics calibration system needs to be further improved.

The remainder of this paper is organized as follows: Section 2 discusses the literature review of GTTM which is applied to fuzzy reasoning of HLFPN model. The system architecture and music segmentation are described in Section 3. The experimental results are discussed in Section 4. Finally, the conclusion is remarked in Section 5.

Literature Review

First, the background knowledge of a generative theory of tonal music (GTTM) rules and the musical context combination method are described. Second, some basic definitions regarding the HLFPN model, and the related works are presented.

Generative Theory of Tonal Music (GTTM)

The traditional method of musical phrase segmentation is based on psychological perspectives, and their relationship is discovered by psychology in music grouping. In 1920s, German psychologists proposed Gestalt theory to group similar objects. In Gestalt theory, they proposed the cognitive psychological aspects of musical groups, including three basic principles, namely, principle of proximity, principle of similarity, and principle of continuation.

In Gestalt theory, the research work in music grouping, composed of Jackendoff‘s and Lerdale’s ideas, proposed a generative theory of tonal music (GTTM) (Chan and Hsiao Citation2016; Frankland and Cohen Citation2018). GTTM constitutes a formal description of the musical intuitions of a listener who is experienced in a musical idiom. It includes the grouping preference rules (GPRs) defined as follows:

Rule 1: Single Event: “Avoid analyses with very small groups- the smaller, the less preferable.”

Rule 2: Proximity: “Consider a sequence of four notes, n1 – n4, the transition n2 – n3 may be heard as a group boundary if A. Slur/Rest: “the time interval from the end of n2 is greater than that from the end of n1 to the beginning of n2 and that from the end of n3 to the beginning of n4; or if B. Attack-Point: “the time interval between the attack points of n2 and n3 is greater than that between the attack points of n1 and n2 and that between the attack points of n3 and n4.”

Rule 3: Change: “Consider a sequence of four notes, n1 – n4, the transition n2 – n3 may be heard as a group boundary if marked by Register, Dynamics, Articulation, Length, and Timbre,” which are shown in .

Figure 1. Register change.

Figure 2. Dynamics change.

Figure 3. Articulation change.

Figure 4. Length change.

Figure 5. Timbre change.

High-Level Fuzzy Petri Net

Based on the original Petri net theory, scholars conduct their research works with evolutionary Petri nets one after another, such as colored Petri nets (Zhang, Zhao, and He Citation2022), timed Petri nets (Lin et al., Citation2020b; Lin et al. Citation2022), fuzzy Petri nets (Chowdhury et al. Citation2019), high-level fuzzy Petri nets (Chiang et al. Citation2020; Lin et al. Citation2020a). The HLFPN model was adopted to make decision on the segmentation of music phrases. It provides the characters of Petri net and fuzzy logic theories, which can be used to model fuzzy production rules and to conduct fuzzy reasoning (Chiang et al. Citation2021). The basic definitions and fuzzy reasoning approach are presented as follows:

Definition 1. The HLFPN is defined as an 8-tuple:

H L F P N = (P, T, F, C, V, α, β, δ)

where:

$P = \{p_{1}, p_{2}, \dots, p_{k}\}$ : a finite set of places.

$T = \{t_{1}, t_{2}, \dots, t_{l}\}$ : a finite set of transitions.

P \cup T = \emptyset

$F \subseteq (P \times T) \cup (T \times P)$ : called the flow relation which is also a finite set of arcs, each representing the fuzzy set (i.e. fuzzy term) for an antecedent or a consequent; where the positive arcs (i.e. THEN parts) are denoted by →.

$C = \{X, Y, Z\}$ : A finite set of linguistic variables, e.g. $X, Y$ , and $Z$ , where $X = \{x_{1}, x_{2}, \dots, x_{n}\}$ , $Y = \{y_{1}, y_{2}, \dots, y_{m}\}$ , $Z = \{z_{1}, z_{2}, \dots, z_{q}\}$ .

$V = \{v_{1}, v_{2}, \dots, v_{l}\}$ : A finite set of fuzzy truth values known as the fuzzy relational matrix between the antecedent and the consequent of a fuzzy production rule.

$α : P \to C$ : An association function, mapping from places to linguistic variables. $α (p_{i}) = c_{i}, i = 1, \dots, I$ , where $C = c_{i}$ is a set of linguistic variables in the knowledge base (KB) and is the number of linguistic variables in the KB;

$β : F \to \{0, 1\}$ : An association function, mapping from the flow relations to the fuzzy truth values between zero and one.

$δ : T \to V$ : An association function, mapping from transitions to fuzzy relational matrices.

Definition 2. Input and Output Functions:

$I (t) = {p \in P | (p, t) \in F}$ : A set of input places of transition $t$ .

$I (p) = {t \in T | (t, p) \in F}$ : A set of input transitions of place $p$ .

$O (t) = {p \in P | (t, p) \in F}$ : A set of output places of transition $t$ .

$O (p) = {t \in T | (p, t) \in F}$ : A set of output transitions of place $p$ .

Definition 3. Membership Function:

The mapping function $M e m (p) : P \to [0, 1]$ assigns each place a real value, where $M e m (p) = D O M (α (p))$ , $D O M$ represents the degree of membership in the associated proposition and data tokens are available in a set of places $P$ .

Definition 4. Max-Min Compositional Rule:

In the HLFPN model, $\forall$ transition t, V(t) = min (fuzzy sets in I(t)); $\forall$ place p, V(p) = max (fuzzy sets in I(p)). The Max-Min composition operator is denoted by ○.

Definition 5. Input Place, Hidden Place, and Output Place:

In the HLFPN model, $\forall$ place p_i $\in$ P, if $\forall$ t_j $\in$ T, p_i $\notin$ O(t_j), then p_i is called an input place (IP) of t_j. if $\forall$ t_j $\in$ T, p_i $\notin$ I(t_j), then p_i is called an output place (OP) of t_j; otherwise, p_i is called a hidden place.

Fuzzy Reasoning

For the fuzzy reasoning, fuzzy production rules are used (Lin et al. Citation2020a). In general, a fuzzy production rule is used to describe fuzzy relationship between the antecedent and the consequent. Let R be a set of fuzzy production rules, where $R = \{R_{1}, R_{2}, \dots, R_{n}\}$ . The general form of the i-th fuzzy production rule $R_{i}$ is shown as follows:

R_{i} : I F d_{j} (X i s A), T H E N d_{k} (Y i s B); E L S E d_{w} (Z i s C) \dots (V)

where $d_{j}, d_{k}, a n d d_{w}$ denote propositions; $X$ is called the input linguistic variable; $Y$ and $Z$ are called the output linguistic variables, respectively; $A$ is called the input fuzzy set; $B$ and $C$ are called the output fuzzy sets, respectively; the fuzzy truth values of the propositions “ $X$ is $A$ ”, “ $Y$ is $B$ ” and “ $Z$ is $C$ ” are restricted to [0, 1]; $d_{j}$ is the antecedent of a fuzzy production rule R_i; d_k and d_w denote the consequents of the fuzzy production rule $R_{i}$ . Let $V$ represent the fuzzy relational matrix between antecedent and consequent of a fuzzy production rule.

Illustrative Example:

Let us consider the fuzzy production rule $R_{1}$ shown as follows:

$R_{1} : I F t h e t e m p e r a t u r e (X_{1}) i s h o t (A_{1}) A N D t h e s k y (X_{2}) i s c l o u d y (A_{2}),$ $T H E N t h e h u m i d i t y (Y) i s h i g h (B) .$

Based on the transformation procedure presented in (Casey et al. Citation2018), the above fuzzy production rule $R_{1}$ is transformed into the following first-order logic form:

R_{1}^{'} : I F X_{1} (A_{1}) A N D X_{2} (A_{2}), T H E N Y (B)

Then, the HLFPN model is shown in .

Figure 6. HLFPN model of illustrative example.

Assume that the fuzzy sets $A_{1}$ , $A_{2}$ and $B$ are shown as follows:

A_{1} = \frac{0.20}{a_{11}} + \frac{0.54}{a_{12}} + \frac{0.24}{a_{13}}

A_{2} = \frac{0.11}{a_{21}} + \frac{0.77}{a_{22}} + \frac{0.42}{a_{23}}

B = \frac{0.31}{b_{1}} + \frac{0.66}{b_{2}} + \frac{0.17}{b_{3}}

By the cylindrical extension operations (Lin et al. Citation2020a), that is, a Cartesian product, we can obtain the antecedent fuzzy set A, shown as follows:

A = A_{1} \times A 2 = (0.20 0.54 0.24)^{T_{\land}} (0.11 0.77 0.42) = [\begin{matrix} 0.11 & 0.20 & 0.20 \\ 0.11 & 0.54 & 0.42 \\ 0.11 & 0.24 & 0.24 \end{matrix}]

Then, the fuzzy relational matrices $V_{1} (t_{1}), V_{2} (t_{2}), a n d V_{3} (t_{3})$ between antecedent and consequent of the fuzzy production rule $R_{1}$ can be obtained, shown as follows:

V_{1} (t_{1}) = [\begin{matrix} 0.11 & 0.20 & 0.20 \\ 0.11 & 0.31 & 0.31 \\ 0.11 & 0.24 & 0.24 \end{matrix}] \in A \times B \times b_{1}

V_{2} (t_{2}) = [\begin{matrix} 0.11 & 0.20 & 0.20 \\ 0.11 & 0.54 & 0.42 \\ 0.11 & 0.24 & 0.24 \end{matrix}] \in A \times B \times b_{2}

V_{3} (t_{3}) = [\begin{matrix} 0.11 & 0.17 & 0.17 \\ 0.11 & 0.17 & 0.17 \\ 0.11 & 0.17 & 0.17 \end{matrix}] \in A \times B \times b_{3}

The most widely used fuzzy reasoning method is the max–min composition inference (Yang et al. Citation2019). Assume that the input fuzzy sets ${A_{1}}^{'}$ and ${A_{2}}^{'}$ are shown as follows:

{A_{1}}^{'} = \frac{0.09}{a_{11}} + \frac{0.85}{a_{12}} + \frac{0.29}{a_{13}}

{A_{2}}^{'} = \frac{0.29}{a_{21}} + \frac{0.89}{a_{22}} + \frac{0.45}{a_{23}}

Then, we can get

{A_{1}}^{'} \circ V_{1} (t_{1}) = (0.09 0.85 0.29) \circ V_{1} (t_{1}) = (0.11 0.31 0.31)

{A_{1}}^{'} \circ V_{2} (t_{1}) = (0.09 0.85 0.29) \circ V_{2} (t_{1}) = (0.11 0.54 0.42)

{A_{1}}^{'} \circ V_{3} (t_{1}) = (0.09 0.85 0.29) \circ V_{3} (t_{1}) = (0.11 0.17 0.17)

Finally, we can obtain

B^{'} = (0.29 0.89 0.45) \circ [\begin{matrix} 0.11 & 0.11 & 0.11 \\ 0.31 & 0.54 & 0.17 \\ 0.31 & 0.42 & 0.17 \end{matrix}]

$= (0.31 0.54 0.17)$

= \frac{0.31}{b_{1}} + \frac{0.54}{b_{2}} + \frac{0.17}{b_{3}}

The above description is the fuzzy reasoning process of HLFPN model.

Fuzzy Reasoning Algorithm

In this sub-section, a fuzzy reasoning algorithm (FRA) (Chiang et al. Citation2020) is briefly reviewed to determine whether there exists a fuzzy relational matrix between antecedent and consequent of a fuzzy production rule or not.

INPUT: Input Place (IP), $M e m (p_{i}) \forall p_{i} \in I P$ , (or fuzzy set), where $I P$ denotes a set of input places.

OUTPUT: Output Place (OP), $M e m (p_{i}) \forall p_{i} \in O P$ , (or fuzzy set), where $O P$ denotes a set of output places.

Procedure

Step 1:

Initially, assume that only the Degree of Memberships (DOMs) in the propositions operating on input variables are available. Consequently, the initial marking function is shown as follows:

M (p_{i}) = 0, i f p_{i} \notin I P

M (p_{i}) = t h e n u m b e r o f d a t a t o k e n s, i f p_{i} \in I P

Step 2:

Calculate the fuzzy relational matrices V( $t_{i}$ ) of the current transition, and use the input to perform cylindrical extension.

$\forall t_{j} \in T,$ compute $V (t_{j}) = W_{a} \times W_{c} = (w_{a 1}, w_{a 2}, \dots, w_{a m})^{T} \land (w_{c 1}, w_{c 2}, \dots, w_{c n}),$ where $T$ denotes a set of transitions; $V (t_{j})$ is a fuzzy relational matrix between the antecedent and the consequent of rule $t_{j}$ ; $W_{a} = (w_{a 1}, w_{a 2}, \dots, w_{a m})$ is a fuzzy set of weights in the antecedent; $W_{c} = (w_{c 1}, w_{c 2}, \dots, w_{c n})$ is a fuzzy set of weights in the consequent; and each element of a fuzzy set is denoted as a fuzzy weight interval.

Step 3:

Input a data pattern to $W_{a}$ -input.

Step 4:

Fire the enabled transitions and execute Zadeh’s max-min operation. Let $t_{j}$ be any enabled transition. Then, compute $t_{j} \in T / \forall p_{k}, M (p_{k})$ = the number of data tokens

$W_{a}^{'}$ = $W_{a}$ -input

W_{c}^{'} = W_{a}^{'} *_{\circ} V (t_{j}) o r \neg W_{a}^{'} *_{\circ} V (t_{j})

if an ELSE part is available.

Step 5:

Send the results to the output place. For every output variable $O$ , its associated membership distribution is ${W_{c}}^{'} = \{{w_{c i}}^{'}\} = \lor {w_{c i}}^{'}, i = 1, 2, \dots, l$ , where $l$ denotes the in-degree of output variable $O$ . Then, $W_{c}^{'}$ becomes an actual output.

Step 6

Go back to Step 4, while $\exists t_{j} \in T / M (p_{i}) = 1, \forall p_{i} \in I P (t_{j})$ (That is, while the enabled transitions still exist, go to Step 4).

Step 7

The weighted average defuzzification method is applied, and the real operation value is obtained.

Ieee 1599

“Music is much more than listening to audio encoded in some unreadable binary format” (Wick, Hartelt, and Puppe Citation2020). So, we need to integrate each karaoke song, score, information, and musical phrase. The standard is named as IEEE1599 (Baggi Citation2015), proposed by Denis L. Baggi and Goffredo M. Haus in 2009. IEEE 1599 is a standard to encode music with XML symbols. The main distinguished characteristics use blocks to represent music and the concept of layers, as shown in .

Figure 7. All layers in IEEE 1599 (Wick, Hartelt, and Puppe Citation2020).

General Layer: The layer provides a general description of the music. It contains the following items: music title, authors, number, data, type, and so on. Structural Layer: The layer stores information of musical structure. Logical Layer: The layer provides music description from a symbolic point and includes scores with symbols, as well as the spine with logically organized symbols, and a sorted list of music events. Notational Layer: The layer includes notations of music, like a great variety of pictures. Performance Layer: The layer contains the information about musical performance, including midi, Csound, etc. Audio Layer: The audio layers including media contents are still available in their original encoding.

Related Works

According to reference (Meier Citation2013; Wu and Cao Citation2018), for the problem of song sentiment classification based on Chinese lyrics, there are two methods that have been widely used at present, namely, a sentiment-dictionary-based method and a machine-learning-based method. The former sentiment dictionary is difficult to expand, while the latter may cause “dimensional disaster” when generating feature vectors. It proposes a song sentiment classification method that combines the advantages of these two ideas. The method obtains the positive and negative tendency values by the process of word segmentation and weight calculation based on sentiment dictionary, which are used as feature vectors. Then, we use Logistic Regression algorithm to carry binary classification of Chinese songs. According to reference (Shen and Chen Citation2014), it was the initial idea to develop this lyrics calibration system, which had lower precision value (71.43%) and no recall values available. More useful parameters and detailed discussions are needed to improve its system performance. The fuzzy reasoning algorithm also needs to be further improved. According to reference (Bilbao et al. Citation2020; Ruangsang and Assawinchaichote Citation2019), as the computational costs for physical modeling synthesis of sound are often much greater than those for conventional sound synthesis methods, most techniques currently rely on simplifying assumptions. They include digital waveguides and modal synthesis methods. However, it can be difficult to approach some of the more detailed behavior of musical instruments.

Proposed Approach

For a karaoke system, it is important to make musical phrases, which are concepts and practices related to grouping consecutive melodic notes in performance and composition. A musical phrase is a unit that has a complete musical sense of its own length, in which a singer or instrumentalist can sing or play in one breath. We have developed a C# program to implement GTTM and HLFPN model to segment musical phrases. Musical phrases are used to automatically calibrate the lyrics that form a database system. And the music notes are automatically calibrated for a Chinese character, which has only one syllable.

Data Preprocessing System

A phrase segmentation technique is highly required because a phrase is the basic unit for organizing and analyzing the music signals. In music scores, it is usually marking a phrase with a slur, as shown in .

Figure 8. A phrase.

For phrase segmentation, as rule 3 in grouping preference, “Consider a sequence of four notes, n₁ – n₄, the transition n₂ – n₃ may be heard as a group boundary if marked by register, dynamics and length.” (Haus and Ludovico Citation2017; Haus, Longari Citation2019; Hu, Chen, and Yin Citation2020), the score function (Liu Citation2010) is defined for the grouping preference rule 3 of GTTM as follows:

The register change rule is assumed that the pitches (in cent) of the four music notes n₁n₂n₃n₄ are c₁c₂c₃c₄. If |c₃-c₂| > |c₂-c₁| and |c₃-c₂| > |c₄-c₃|, then it is defined as:

(1)

f_{r e g i s t e r} = \{1 - \frac{|c_{2} - c_{1}| + |c_{4} - c_{3}|}{2 |c_{3} - c_{2}|}\}

(1)

else, defined as $f_{r e g i s t e r} = 0$ . It represents that the pitch has not been changed.

Similarly, the dynamics change rule is assumed that the intensities (in dB) of the four music notes n₁n₂n₃n₄ are d₁d₂d₃d₄. If |d₃-d₂| > |d₂-d₁| and |d₃-d₂| > |d₄-d₃|, then it is defined as

(2)

f_{d y n a m i c s} = \{1 - \frac{|d_{2} - d_{1}| + |d_{4} - d_{3}|}{2 |d_{3} - d_{2}|}\}

(2)

else, defined as $f_{d y n a m i c s} = 0$ . It represents the intensity not being changed.

Also, the length change rule is assumed that the lengths (in second) of the four music notes n₁n₂n₃n₄ are l₁l₂l₃l₄ defined as:

(3)

f_{l e n g t h} = \{1 - \frac{M i n (l_{2}, l_{3})}{M a x (l_{2}, l_{3})}\}

(3)

First, the membership function can be defined in accordance with the input parameters. The fuzzy reasoning rule and the corresponding HLFPN model can be established. The $f_{r e g i s t e r}$ , $f_{d y n a m i c s}$ and $f_{l e n g t h}$ associated with the membership functions for the fuzzifier are input into the HLFPN model. With the characters of fuzzy reasoning, each input parameter is reasoned by the fuzzy rule. Finally, all indices are integrated to make decision on the musical phrase segmentation and to perform automatic calibration of karaoke lyrics (Bastanfard, Amirkhani, and Naderi Citation2020; Chen Citation2021).

HLFPN Model Based on GTTM

To make more precise in the musical phrase segmentation, it is a good way to implement grouping preference rules of GTTM. The decision based on input parameters uses the calculated values to set the position of segmentation. However, an input parameter needs to be considered as a fuzzy term, high or low. Grouping preference rules of GTTM including three parameters are used to determine where the position of performance or segmentation is. In these unclear, obscure, and ambiguous cases, it is appropriate to use the fuzzy logic theory. Thus, the HLFPN model is applied to make decision (Shen et al. Citation2017), as shown in .

Figure 9. Block diagram of HLFPN model.

Calibration Mechanism in Membership Degrees

According to the corresponding membership functions and fuzzy sets defined, we input each parameter to a fuzzifier and calculate the membership degrees in each set. Based on the size of the input parameter, each parameter is transformed to 「If … then … 」statement and the fuzzy rules are established. In the decision method, three parameters are used, namely, the $f_{r e g i s t e r}$ , $f_{d y n a m i c s}$ and $f_{l e n g t h}$ . As a result, each index is defined in membership degrees of Low and High. Then, three sets of technical indices include the S-Type and Z-Type membership degrees, as shown in .

Figure 10. The type of membership degrees for input parameter.

In addition, the decision is also divided into two cases, namely, “Performance” and “Segmentation.” Therefore, two sets of performance and segmentation belong to the Λ-Type of membership degree, as shown in .

Figure 11. The type of membership degrees for technical decision.

In this paper, three input parameters are used, namely, the $f_{r e g i s t e r}$ , $f_{d y n a m i c s}$ and $f_{l e n g t h}$ . As a result, the membership functions of “high” and “low” are defined for each input parameter. Therefore, all of them are defined in . The membership degrees of the decision on music performance and segmentation are listed in .

Table 1. Membership degrees of input parameter.

Display Table

Table 2. Membership degrees of technical decision.

Download CSV Display Table

In the analysis, the values of each input parameter are usually set between 0 and 1. When $f_{r e g i s t e r}$ , $f_{d y n a m i c s}$ , and $_{l e n g t h}$ values larger than 0.3 are high, and those values less than 0.7 are low. However, due to the membership degree as defined above, whose scope of a variable is set between 0 and 1. Thus, the values of each input parameter must be converted to the values between 0 and 1 before they are substituted into the calculation of basic rules.

Assume that $ν_{H}$ and $ν_{L}$ values represent membership degrees of “high” and “low,” respectively. The fuzzification procedure is defined as follows:

(4)

v_{H} (x) = {\begin{matrix} 1, x \geq 0.8 \\ 2 (x - 0.4), 0.3 < x < 0.8 \\ 0, x \leq 0.3 \end{matrix}

(4)

(5)

v_{L} (x) = {\begin{matrix} 0, x \leq 0.2 \\ 2 (x - 0.1), 0.7 > x > 0.2 \\ 1, x \geq 0.7 \end{matrix}

(5)

(6)

v_{S e g m e n t a t i o n} (x) = \{\begin{matrix} 0, x \geq 0.7 \\ \begin{matrix} - \frac{1}{0.3} (x - 1.0), 0.5 \leq x < 0.7 \\ \frac{1}{0.3} (x - 0.4), 0.3 < x < 0.5 \end{matrix} \\ 0, x \leq 0.3 \end{matrix}

(6)

(7)

v_{P e r f o r m a n c e} (x) = \{\begin{matrix} 0, x \geq 0.4 \\ \begin{matrix} - 5 (x - 0.5), 0.2 \leq x < 0.4 \\ 5 (x - 0.0), 0.0 < x < 0.2 \end{matrix} \\ 0, x \leq 0.0 \end{matrix}

(7)

Fuzzy Reasoning and Building HLFPN Model

According to fuzzy sets and their corresponding membership degrees as defined previously, each input parameter is imported to a fuzzifier and the technical index is calculated to find its own membership degree. Based on the size of an input parameter, it has been transformed into an「If … then … 」statement in order to establish a rule base. The activity diagram of fuzzy reasoning is shown in .

Figure 12. The activity diagram of fuzzy reasoning.

Assume that $f_{r e g i s t e r}$ , $f_{d y n a m i c s}$ , and $f_{l e n g t h}$ denote input linguistic variables with fuzzy terms: high (H) and low (L). And assume that the decision (D) is an output linguistic variable with fuzzy terms: strong (S) and weak (W). The fuzzy production rules are defined as follows:

R1: If $f_{r e g i s t e r}$ is H, then D is S.

R2: If $f_{r e g i s t e r}$ is L, then D is W.

R3: If $f_{d y n a m i c s}$ is H, then D is S.

R4: If $f_{d y n a m c}$ is L, then D is W.

R5: If $f_{l e n g t h}$ is H, then D is S.

R6: If $f_{l e n g t h}$ is L, then D is W.

Then, the fuzzy production rules are transformed into the HLFPN model, as shown in , and the parameters are listed in .

Figure 13. The HLFPN model representing six fuzzy production rules.

Table 3. Description of parameters.

Display Table

According to the proposed HLFPN model, the fuzzy inference is performed with the technical indices of a fuzzier and the fuzzy rules in the rule base. Within the process of fuzzy inference, the standard operators for calculations are used. Finally, the crisp outputs are obtained to decide performance or segmentation by the weighted average defuzzification method, and the real operation value of “Decision” can be computed.

Preprocessing Results

However, the music lyrics, score, and other music information are integrated into an automatic calibration system. A new music standard named IEEE 1599 and a software system are used to show the preprocessing results. The IEEE 1599 standard (Baggi Citation2015) is used to encode music with XML symbols. This is an example in IEEE 1599 from a song named “Silence,” composed by Jay Chou, as shown in .

Figure 14. Layers in IEEE 1599.

Figure 15. Description in general layer of IEEE 1599.

Figure 16. The related files in general layer of IEEE 1599.

Figure 17. Layers in IEEE 1599.

Experimental Results

To demonstrate that the proposed HLFPN-based calibration system is feasible, this Section is intended to evaluate the proposed system performance. A fuzzy reasoning algorithm (FRA) (Chiang et al. Citation2020) is used to determine whether a fuzzy relational matrix exists between antecedent and consequent of a fuzzy rule. Then, our proposed approach is applied to the illustrative example.

Example of HLFPN Model for Fuzzy Reasoning

In this sub-section, an example is used to illustrate the viability of the fuzzy reasoning process. Two digits of significant numbers represent the result of each calculation.

Step 1: Initially, assume that only the DOMs in the propositions operating input variables are available. Assume that four fuzzy sets are shown as follows:

$H = \frac{0.00}{h_{L}} + \frac{0.70}{h_{H}}$ , $L = \frac{0.30}{l_{L}} + \frac{0.00}{l_{H}}$ , $S = \frac{0.00}{s_{L}} + \frac{0.70}{s_{H}}$ , $W = \frac{0.30}{w_{L}} + \frac{0.00}{w_{H}}$

Step 2: Compute the fuzzy relational matrices, shown as follows:

V (t_{1}) = H * S = (0.00 0.70)^{T} \land (0.00 0.70) = [\begin{matrix} 0.00 & 0.00 \\ 0.00 & 0.70 \end{matrix}]

V (t_{3}) = V (t_{1}) = [\begin{matrix} 0.00 & 0.00 \\ 0.00 & 0.70 \end{matrix}]

V (t_{5}) = V (t_{1}) = [\begin{matrix} 0.00 & 0.00 \\ 0.00 & 0.70 \end{matrix}]

V (t_{2}) = L * W = {(0.30 0.00)}^{T} \land (0.30 0.00) = [\begin{matrix} 0.30 & 0.00 \\ 0.00 & 0.00 \end{matrix}]

V (t_{4}) = V (t_{2}) = [\begin{matrix} 0.30 & 0.00 \\ 0.00 & 0.00 \end{matrix}]

V (t_{6}) = V (t_{2}) = [\begin{matrix} 0.30 & 0.00 \\ 0.00 & 0.00 \end{matrix}]

Step 3: Input data pattern, shown as follows:

H_{f r e g i s t e r}^{'} = \frac{0.00}{h_{L}^{'}} + \frac{0.20}{h_{H}^{'}}

L_{f r e g i s t e r}^{'} = \frac{0.80}{l_{L}^{'}} + \frac{0.00}{l_{H}^{'}}

H_{f d y m a n i c s}^{'} = \frac{0.00}{h_{L}^{'}} + \frac{0.80}{h_{H}^{'}}

L_{f d y m a n i c s}^{'} = \frac{0.00}{l_{L}^{'}} + \frac{0.00}{l_{H}^{'}}

H_{f l e n g t h}^{'} = \frac{0.00}{h_{L}^{'}} + \frac{0.40}{h_{H}^{'}}

L_{f l e n g t h}^{'} = \frac{0.20}{l_{L}^{'}} + \frac{0.00}{l_{H}^{'}}

Step 4: Fire the enabled transitions:

S_{f r e g i s t e r}^{'} = H_{f r e g i s t e r}^{'} \circ V (t_{1}) = [\begin{matrix} 0.00 & 0.20 \end{matrix}] \circ [\begin{matrix} 0.00 & 0.00 \\ 0.00 & 0.70 \end{matrix}] = [\begin{matrix} 0.00 & 0.20 \end{matrix}]

W_{f r e g i s t e r}^{'} = L_{f r e g i s t e r}^{'} \circ V (t_{2}) = [\begin{matrix} 0.80 & 0.00 \end{matrix}] \circ [\begin{matrix} 0.30 & 0.00 \\ 0.00 & 0.00 \end{matrix}] = [\begin{matrix} 0.30 & 0.00 \end{matrix}]

S_{f d y n a m i c s}^{'} = H_{f d y n a m i c s}^{'} \circ V (t_{3}) = [\begin{matrix} 0.00 & 0.80 \end{matrix}] \circ [\begin{matrix} 0.00 & 0.00 \\ 0.00 & 0.70 \end{matrix}] = [\begin{matrix} 0.00 & 0.70 \end{matrix}]

W_{f d y n a m i c s}^{'} = L_{f d y n a m i c s}^{'} \circ V (t_{4}) = [\begin{matrix} 0.00 & 0.00 \end{matrix}] \circ [\begin{matrix} 0.30 & 0.00 \\ 0.00 & 0.00 \end{matrix}] = [\begin{matrix} 0.00 & 0.00 \end{matrix}]

S_{f l e n g t h}^{'} = H_{f l e n g t h}^{'} \circ V (t_{5}) = [\begin{matrix} 0.00 & 0.40 \end{matrix}] \circ [\begin{matrix} 0.00 & 0.00 \\ 0.00 & 0.70 \end{matrix}] = [\begin{matrix} 0.00 & 0.40 \end{matrix}]

W_{f l e n g t h}^{'} = L_{f l e n g t h}^{'} \circ V (t_{6}) = [\begin{matrix} 0.20 & 0.00 \end{matrix}] \circ [\begin{matrix} 0.30 & 0.00 \\ 0.00 & 0.00 \end{matrix}] = [\begin{matrix} 0.20 & 0.00 \end{matrix}]

Steps 5–6: Finally, the fuzzy reasoning result is shown as follows:

D = {S_{f r e g i s t e r}^{'}}^{} \cup {W_{f r e g i s t e r}^{'}}^{} \cup S_{f d y n a m i c s}^{'} \cup W_{f d y n a m i c s}^{'} \cup S_{f l e n g t h}^{'} \cup W_{f l e n g t h}^{'}

= \frac{0.30}{d_{L}} + \frac{0.70}{d_{H}}

Step 7: If the weighted average defuzzification method is applied, then the real operation value of “Decision” can be computed, shown as follows:

d e c i s i o n = \frac{0.30 \times d_{L} + 0.70 \times d_{H}}{0.30 + 0.70} = \frac{0.30 \times 0.30 + 0.70 \times 0.70}{0.30 + 0.70} = 0.58

Due to $0.3 \leq d e c i s i o n < 0.7$ , the decision is made as “Segmentation.”

Main Results

The experimental results illustrate the effectiveness of the proposed system. An automatic calibration system for karaoke lyrics is useful, but its musical phrase segmentation may have a little deviation. The comparison results are shown in .

Figure 18. Deviation in automated system.

Figure 19. The actual score.

According to manual experiments, the automatic music segmentations which are measured by the recall and the precision values defined as follows:

(11)

r e c a l l = \{\frac{n u m b e r o f p h r a s e s c o r r e c t l y det e c t e d}{n u m b e r o f p h r a s e s}\} \times 100 %

(11)

(12)

p r e c i s i o n = \{\frac{n u m b e r o f p h r a s e s c o r r e c t l y det e c t e d}{n u m b e r o f p h r a s e s det e c t e d}\} \times 100 %

(12)

Finally, in the experiment, 50 Chinese pop songs were used. In total, there are 2076 musical phrases manually labeled. As shown in , there are 1463 musical segmentations that are correctly detected by the proposed system. Therefore, the average recall is 90.46% and the average precision is 92.78%.

Table 4. Experimental results.

Download CSV Display Table

Functional Comparison

Based on the above experimental results, we have made a functional comparison with each other among different approaches, including ours, Wu’s (Wu and Cao Citation2018), and Bilbao’s (S. Bilbao et al. Citation2020). The results of functional comparison are listed in . In summary, our proposed approach is more feasible and powerful than others.

Table 5. Results of functional comparison.

Download CSV Display Table

“ˇ” denotes “Yes” or “Available.”

Conclusion

In this paper, those datasets including karaoke lyrics, songs, titles, and author names downloaded from the Internet have been used, and a database system of Chinese pop songs for automatic calibration has been built. The grouping preference rules of GTTM are adopted to perform the parameter analysis as a basis for determination. The appropriate decisions are made based on the above parameters by applying the reasoning algorithm of HLFPN model. A C# program has been developed to implement this automatic calibration system. Users can use this system to sing a pop song from which the singer’s voices are removed, and the lyrics can be calibrated automatically. The contributions of this paper are stated as follows:

A robust database system with Chinese pop songs has been successfully built, providing a useful environment for musical analysis.
The grouping preference rules of GTTM are easily used to group each song.
The parameters obtained from the register change rule, the dynamics change rule, and the length change rule are all used to easily conduct technical analysis for musical decision.
Through the fuzzy reasoning algorithm of HLFPN model, a fast decision can be made on music performance or segmentation.
This system can provide users with an easy way to sing a pop song and to get the lyrics automatically.

In the future, the learning algorithms of the HLPFN model will be adopted to accurately detect music segmentations. Also, a specific tool of HLFPN model will be created to predict the music segmentation and to make a difference between HLFPN model and GTTM. Since this system is now designed for Chinese karaoke songs only, it has been limited to some users. Therefore, it will be enhanced for English songs or songs in other languages to provide more applications.

Acknowledgments

The authors are grateful to the anonymous reviewers for their constructive comments, which have improved the quality of this paper.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Funding

This work was supported by the Ministry of Science and Technology, Taiwan MOST 107-2221-E-845- 001-MY3, ROC, under grants MOST 110-2637-E-131-005- and MOST 110-2221-E-845-002-.

References

Baggi, D. L. 2015. An IEEE standard for symbolic music. IEEE Computer, 100–3048.
Google Scholar
Bastanfard, A., D. Amirkhani, and S. Naderi. 2020. A signing voice separation method from Persian music based on pitch detection methods, Procs. of 2020 6th Iranian Conference on Signal Processing and Intelligent Systems (ICSPIS), Mashhad, Iran, 1–6.
Google Scholar
Bilbao, S., C. Desvages, M. Ducceschi, B. Hamilton, R. Harrison-Harsley, A. Torin, and C. Webb. 2020. Physical modeling, algorithms, and sound synthesis: The NESS Project. Computer Music Journal 43 (2–3):15–30. doi:10.1162/comj_a_00516.
Web of Science ®Google Scholar
Casey, M. A., R. Veltkamp, M. Goto, M. Leman, C. Rhodes, and M. Slaney. 2018. Content-based music information retrieval: Current directions and future challenges. Proceedings of the IEEE more author. 96 (4):668–96. doi:10.1109/JPROC.2008.916370.
Google Scholar
Chan, A. B., and J. H. Hsiao. 2016. Information distribution within musical segments. Music Perception: An Interdisciplinary Journal 34 (2):218–42. doi:10.1525/mp.2016.34.2.218.
Web of Science ®Google Scholar
Chen, G.-F. 2021. Music sheet score recognition of Chinese Gong-che notation based on deep learning. Proceedings of 2021 International Conference on Big Data Analysis and Computer Science (BDACS), Kunming, China, 1–6.
Google Scholar
Chiang, D.-L., S.-K. Wang, Y.-N. Lin, C.-Y. Yang, V. R. L. Shen, T. T.-Y. Juang, and T.-Y. Liao. 2021. Development and evaluation of a novel investment decision system in cryptocurrency market. Applied Artificial Intelligence 35 (14):1169–95. doi:10.1080/08839514.2021.1975380.
Web of Science ®Google Scholar
Chiang, D.-L., S.-K. Wang, -Y.-Y. Wang, Y.-N. Lin, T.-Y. Hsieh, C.-Y. Yang, V. R. L. Shen, and H.-W. Ho. 2020. Modeling and analysis of Hadoop MapReduce systems for big data using Petri nets. Applied Artificial Intelligence 35 (1):80–104. doi:10.1080/08839514.2020.1842111.
Web of Science ®Google Scholar
Chowdhury, J. I., D. Thornhill, P. Soulatiantork, Y. Hu, N. Balta-Ozkam, L. Varga, and B. K. Nguyen. 2019. Control of supercritical organic rankine cycle based waste heat recovery system using conventional and fuzzy self-tuned PID controllers. International Journal of Control, Automation and Systems 17 (4):2969–81. doi:10.1007/s12555-018-0766-6.
Google Scholar
Frankland, B. W., and A. J. Cohen. 2018. Parsing of melody: Quantification and testing of the local grouping rules of Lerdahl and Jackendoff’s a generative theory of tonal music. Music Perception: An Interdisciplinary Journal 21 (4):499–543. doi:10.1525/mp.2004.21.4.499.
Google Scholar
Goto, M. 2016. A chorus-section detecting method for musical audio signal and its application to a music listening station. IEEE Transactions on Audio, Speech, and Language Processing 14 (5):1783–94. doi:10.1109/TSA.2005.863204.
Google Scholar
Haus, G., and M. Longari. 2019. Time-based music description approach based on XML. Computer Music Journal 29 (1):70–85. doi:10.1162/comj.2005.29.1.70.
Google Scholar
Haus, G., and L. A. Ludovico. 2017. Music segmentation: An XML-oriented approach. Lecture Notes in Computer Science 33 (10–1):330–46.
Google Scholar
Hu, D., Z. Chen, and F. Yin. 2020. Passive geometry calibration for microphone arrays based on distributed damped Newton optimization. IEEE/ACM Transactions on Audio, Speech, and Language Processing 29 (11):118–31. doi:10.1109/TASLP.2020.3037532.
Google Scholar
Lin, Y.-N., T.-Y. Hsieh, C.-Y. Yang, V. R. L. Shen, T. T.-Y. Juang, and W.-H. Chen. 2020a. Deep Petri nets of unsupervised and supervised learning. Measurement and Control Online. 53 (7–8):1–11. doi:10.1177/0020294020923375.
Web of Science ®Google Scholar
Lin, Y.-N., S.-K. Wang, G.-J. Chiou, C.-Y. Yang, V. R. L. Shen, T. T.-Y. Juang, and T.-J. Huang. 2022. Novel deadlock control for smartphone manufacturing systems using Petri nets. International Journal of Control, Automation and Systems 20 (3):877–87. doi:10.1007/s12555-020-0239-6.
Web of Science ®Google Scholar
Lin, Y.-N., S.-K. Wang, C.-Y. Yang, V. R. L. Shen, T. T.-Y. Juang, and C.-S. Wei. 2020b. Novel JavaScript malware detection based on fuzzy Petri nets. Journal of Intelligent & Fuzzy Systems Online:1–26.
Web of Science ®Google Scholar
Liu, -C.-C. 2010. Automatic phrase segmentation of MP3 songs based on the technique of breath sound detection. Proceedings of IEEE 7th International Conference on Information Technology and Applications, Taipei, Taiwan, 285–90.
Google Scholar
Matsumoto, M., and J. Hori. 2019. Classification of silent speech using support vector machine and relevance vector machine. Applied Soft Computing 20 (1):95–102. doi:10.1016/j.asoc.2013.10.023.
Google Scholar
Meier, W. 2013. eXist: An opensource native XML database. Lecture Notes in Computer Science 2593 (1):169–83.
Google Scholar
Ruangsang, S., and W. Assawinchaichote. 2019. Control of nonlinear Markovian jump system with time varying delay via robust H fuzzy state feedback plus state-derivative feedback controller. International Journal of Control, Automation and Systems 17 (3):2414–29. doi:10.1007/s12555-019-0044-2.
Google Scholar
Shen, V. R. L., and H.-C. Chen. 2014. An automatic calibration system for Chinese karaoke lyrics based on high-level fuzzy Petri nets. Procs. of 2014 International Conference on Machine Learning and Cybernetics (ICMLC), Lanzhou, China, 1–6.
Google Scholar
Shen, V. R. L., R.-K. Shen, C.-Y. Yang, and W.-C. Chen. 2017. A novel fall prediction system on smartphones. IEEE Sensors Journal 17 (6):1865–71. doi:10.1109/JSEN.2016.2598524.
Web of Science ®Google Scholar
Sung, W.-T., and K.-Y. Chang. 2019. Health parameter monitoring via a novel wireless system. Applied Soft Computing 22 (1):667–80. doi:10.1016/j.asoc.2014.04.036.
Google Scholar
Tsai, W. H., and H.-C. Lee. 2017. Automatic evaluation of karaoke singing based on pitch, volume, and rhythm features. IEEE Transactions on Audio, Speech, and Language Processing 20 (4):1233–43. doi:10.1109/TASL.2011.2174224.
Google Scholar
Wick, C., A. Hartelt, and F. Puppe. 2020. Lyrics recognition and syllable assignment of medieval music manuscripts, Procs. of 2020 17th. International Conference on Frontiers in Handwriting Recognition (ICFHR), Dortmund, Germany, 1–6.
Google Scholar
Wu, X., and Y. Cao. 2018. Research on song sentiment binary classification based on Chinese lyrics. Proceedings of 2018 IEEE/ACIS 17th International Conference on Computer and Information Science (ICIS), Singapore, 22–29.
Google Scholar
Yang, S.-H. Y.-N. L., G.-J. Chiou, M.-K. Chen, V. R. L. Shen, H.-Y. Tseng, and H.-Y. Tseng. 2019. Novel shot boundary detection in news streams based on fuzzy Petri nets. Applied Artificial Intelligence 33 (12):1035–57. doi:10.1080/08839514.2019.1661118.
Web of Science ®Google Scholar
Zhang, Y., H. Zhao, and C. He. 2022. Robust control design with optimization for uncertain mechanical systems: Fuzzy set theory and cooperative game theory. International Journal of Control, Automation and Systems 20 (2):1377–92. doi:10.1007/s12555-020-0874-y.
Google Scholar

Download PDF

Share icon
Back to Top

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Your download is now in progress and you may close this window

Did you know that with a free Taylor & Francis Online account you can gain access to the following benefits?

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Have an account?
Login now Don't have an account?
Register for free

Login or register to access this feature

Have an account?
Login now Don't have an account?
Register for free

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Development and Evaluation of an Intelligent System for Calibrating Karaoke Lyrics Based on Fuzzy Petri Nets

ABSTRACT

Introduction

Literature Review

Generative Theory of Tonal Music (GTTM)

High-Level Fuzzy Petri Net

Fuzzy Reasoning

Fuzzy Reasoning Algorithm

Procedure

Step 1:

Step 2:

Step 3:

Step 4:

Step 5:

Step 6

Step 7

Ieee 1599

Related Works

Proposed Approach

Data Preprocessing System

HLFPN Model Based on GTTM

Calibration Mechanism in Membership Degrees

Table 1. Membership degrees of input parameter.

Table 2. Membership degrees of technical decision.

Fuzzy Reasoning and Building HLFPN Model

Table 3. Description of parameters.

Preprocessing Results

Experimental Results

Example of HLFPN Model for Fuzzy Reasoning

Main Results

Table 4. Experimental results.

Functional Comparison

Table 5. Results of functional comparison.

Conclusion

Acknowledgments

Disclosure statement

Additional information

Funding

References

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date