Q-matrix is a matrix describing relations of questions and concepts required for their understanding. It is a domain-independent model of knowledge represented by a binary matrix showing the relationship between test items and latent or underlying attributes, or concepts.
Q-matrix is a MxN matrix, where M equals the number of questions in an assessment, and N equals the total number of concepts required for understanding all questions. The matrix element A[i,j] equals 1 if the i-th concept is required for correctly answering j-th question and 0 if the i-th concept is NOT required for correctly answering j-th question. Alternatively, matrix values can be not just {0,1}, but real numbers from the interval [0,1], describing the probability that a student who knows i-th concept will correctly answer j-th question.
From the matrix, one can read that knowledge of concept 1 is required in order to answer correctly questions 3, 4 and 5. One can also read that questions 1 and 2 test only the knowledge of concept 2.
Furthermore, one could also say that the ideal response of a student taking the test formed of those 5 questions who knows only concept 1 should be “00001”. This is so since he does not know concept 2 which is required for questions 1 and 2 (therefore the leading “00”). Yet this concept is also required for correct ansqering of questions 3 and 4 so he can not answer those questions neither (therefore the following “00”). Finally, 5th question requires only knowledge about the concept 1 so the student can answer this question correctly (therefore the ending “1”, forming all together “00001”).
Q-matrix can be used for understanding students' performance. Due to various knowledge and assessment characteristics, students' responses rarely match ideal responses generated from the matrix. Still, by assigning the closest ideal response to a student's response vector, it can be assumed which concepts the student does, and which he does not know. This information can be used in order to direct him in further learning.
The goal of q-matrix construction is to extract underlying, or latent, variables, which account for students' differential performance on questions. A q-matrix can be created in two ways:
Factor analysis can be considered as an alternative to q-matrices. Concepts are in that case automatically determined using covariance matrix. Number of concepts should be smaller than number of questions. Still, this methos has proven to be less fault tollerant. Still, when forming a correlation matrix, we lose individual student data in favor of calculating average relationships between questions. The q-matrix method is optimized to assign each student the most appropriate knowledge state, using all available response data for each student.
As Sellers found in her research, the results obtained through q-matrix analysis seem to describe relationships among variables in interpretable ways. Factor analysis and principal components analysis, on the other hand, do not readily offer interpretable results.
Later researchers found that, although the q-matrix model was a good way to compare student data to a concept model, expert-constructed q-matrices did not correspond to student data any better than random q-matrices did.
The findings in Brewer's previous research, which found that the factor analysis method performed poorly in comparison with the q-matrix method when fewer observations were available.