I have read through a lot of the threads here and have found a good amount of useful input...but there are a couple of questions that remain unanswered.
I am storing questions & answers from a questionnaire in a database.
I have the tables:
Survey (surveyID)
Question (questionID, surveyID, questionType, Question)
Answer (answerID, userID, questionID, answer)
User (userID, username)
Question 1: multi-value questions...I would have a separate row for each value in the answer table....but have the same questionID and userID. But then how would you work the following:
-what are your coping strategies (multi-value)
-how frequently do you use each coping strategy?
i.e. a one-one relationship of coping strategy-frequency.
The solution above (i.e. one row per answer doesn't work because you need the relation between the specific coping strategy and the frequency).
A similar question is for the following:
have you been involved in conflicts over land-use rights?
with whom? (multi-value)
for what reasons?
(i.e. what were your reasons for conflict with the neighbours, what were your reasons for conflict with the authorities?) ...i.e. one to many on a multi-value attribute
Thank you in advance, I hope I have explained my query sufficiently well.
Becky
Related
I'm designing the database for an application in which the user is presented with questions, and he must answer them. Think of it either as a questionnaire or as a quiz game, the concept applies to both. I plan to have:
a table with the questions
a table with the possible answers, each of them linked to the question it belongs to with a foreign key (let's keep things simple and assume it's a 1:many relationship, where answers cannot be shared between questions)
a table with the answers that users provided (with foreign keys to the question, the answer and the user ID)
Since many of the questions will be common cases, like yes/no, I decided I'd specify a "question type" enumeration to each question. If the application sees a yes/no question, for example, it means there are no answers in the database, and the application will automatically add the two answers, "Yes" and "No". This saves me hundreds or thousands of useless rows in the answers table.
However, I'm not sure how I should define the table to record user answers. Without the special types of questions, I'd just record the question ID, the answer ID and the user ID, which means "user X answered Y to question Z". However, "yes/no" questions would not have a matching answer in the table, so I can't use the answer ID.
Even making the answers shareable between questions (by making a many-to-many relationship between questions and answers) is not a good solution. Sure, it would allow me to define "Yes" and "No" as regular answers, but then applications should be aware that a "yes/no" question uses answers (say) 7 and 8 - or, when creating a "yes/no" question answers 7 and 8 should be bound to that question. But this means that these "special" answers' IDs must be hardcoded somewhere else. Also, this would not scale well should I add more special types of question in the future.
How should I proceed? Ideally, I need to store in each row of my "user answers" table either a fixed value or a foreign key to the answers table. Is there a better solution than using two columns, one of which is NULL?
I'm using SQL Server, if that matters.
Based on your description I think I'd go on the route of adding another column to the table and making the FK column nullable.
You'd probably have only a few choices for those special questions, so a nullable TINYINT datatype would cut it, and it is only 1 extra byte for your answer row. If this extra column happen to raise the number of columns to more than a multiple of eight, say you go from 8 to 9 or 16 to 17, than you pay another extra byte for the growth of the null bitmap. But it's 2 extra bytes per row worst case.
My data consists of questions and answers. It also shows what the versionNr is of the answer and shows which persons changed which Answer. Its possible that some questions have the same answer.
questionID Question Answer VersionNr User date
1 Who is....? W.H. Smith 1.0 ...#test.com 1/1/14
1.1 ...#test.com 3/8/14
2 What is...? 3% 1.0 ...#test.com 1/2/14
RG = Repeating Group
Bold = Composite/Primary Key
0NF:
(questionID, question, AnswerID, answer,RG{versionNr, user, date}
1NF:
(questionID, question, AnswerID, answer)
(questionID, AnswerID, VersionNr, user, date)
2NF/3NF:
Q(questionID, question, AnswerID)
Ans(AnswerID, answer)
Version(questionID, AnswerID, VersionNr, user, date)
My question is whether I should remove questionID from Version, because the versionNr, date and User gives information about the Answer and Not the question.
If there can be many version records for one answer record, then yes, QuestionID should not be in Version, as it is redundant data: You can always tell the QuestionID of a Version by looking up the Answer record.
That said, it's not clear to me what you're trying to model. When you say that a user can change an answer, do you mean that this is a quiz and users can change what the "right answer" is? Or do you mean that they can change the answer they gave to a quiz or survey?
But if there are many Version records for one Answer, than you are not recording what the answer was at each step, you are just recording who made the change. Maybe that's all you need to know for your purposes. But if the idea is that you want to record what the answer was for each version, then Answer and Version should be a combined into a single table. In that case you need questionID in the Answer/Version record because otherwise you have no way to know which question any answer other than the latest is for.
If the idea is that there is just one Answer record for each Question, and then many versions, then Question and Answer should be combined into a single record. Well, you say that several questions can share the same answer. As I asked in a comment, what does this mean? Are you talking about answers that are the same by coincidence, like on question is "How much is 2+2?" and another is "What is the square root of 16?" and both answers happen to be 4? Or do you mean that they are really the same answer in the sense that if the answer to one question changed, the answer to the other would logically and inevitably have to change to the same thing? Like, "Who is the current vice president of the United States?" and "If the president of the United States died, who would become president?"
I'd think the logical schema would be this:
Question (questionID, question_text)
Answer (answerID, questionID, version_number, answer_text, user, change_date)
Then the current answer is
select answer_text
from answer
where questionID=#qid
and version_number=(select max(version_number) from answer
where questionID=#qid)
This requires no redundant data.
An obvious denormalization for performance and simplicity would be to put the answerID of the current answer in the Question record.
If I understand your problem correctly, I believe you can do the following to achieve a working 3NF schema.
Q(questionID, question)
Ans(AnswerID,answer,QuestionID)
Version(VersionNr,AnswerID,User,date)
(Italics are foreign keys)
So, the primary key of version is triple {VersionNr,AnswerID,User). Please explain if something is not correct with my solution.
So, to sum up, no, you don't need to have questionID inside version, since you can find it with joins.
UPDATE
I think I understand your problem and I believe the correct solution is the following.
Q(questionID,question)
Ans(AnswerID,answer)
Version(QuestionID,AnswerID,VersionNr,user,date)
Actually, your relationship between answers and question is many to many, since many answers are connected to many questions. So, you can use Version as the intermediate table to construct this many to many relationship.
Additionally, you can add the version number, the user and the date in that intermediate table, to have all the necessary information.
I need to store in a DB (MSSQL) multiple answer questions, and I am having trouble deciding what is the best way to do so. Should I store questions in one table, all answers in another and right answers in a 3rd table? Should they all be in the same table?
I would love to hear your ideas
thanks
Amit
Each concept, or entity, should be in its own table.
Questions in a Question table
Answers in an Answer Table with a QuestionID with a boolean field indicating its right or wrongness
(unless Answers can belong to more than one question?)
To me in your case best way is to hold question and answer in seperate tables. If your question has multiple answer and you maybe want to use one answer in a lot of question its the best way. You should also add table for which answer is good for specific question.
In this way you don't have data redundancy in database.
I agree with podiluska.
Each entity in a table.
If all or any of answer can be "typified" (i.e. not "free answer"), add a table for "typified answers" relating this one with table questions (to prevent someone to choose inapplicable "typified answers".
In table "user answers" relate it with table "questions" and if some record (or answer) contains a not "typified" you can mark it with a boolean column indicating "right" or "wrong".
I hope have helped you.
Wondering what is the best way to model out a questionnaire in terms of saving the response. So I would assume the ideal way to save it would be a table like this:
ID_pk
response_id
question_id
value
So each entry would be an answer to a question. The question is when I search on this table the performance would be bad. I was wondering do I need to create another table on the side to help me with performance. Any ideas?
Thanks
The best approach depends on how flexible you need this to be. Will you have multiple choice questions? Numeric-only questinos used for calculaitons? True/False questions, etc.
The approach that worked for us to enable ultimate flexibility has this structure:
Table: Surveys
SurveyId PK
Surveyname
Table QuestionTypes
QuestionType PK
QuestionTypeText (ie "Multiple Choice", "True/False", etc.)
Table Questions
SurveyId FK
QuestionId PK
QuestionText
QuestionType
Table Answers (Contains answers ONLY for multiple choice questions)
Answerid PK
QuestionId FK
AnswerText
Table Results
ResultId PK
Surveyid FK
QuestionId FK
QuestionType (copied in from Question)
MultipleChoiceAnswerid (null unless it was a multiple choce question, FK to Answer otherwise)
BooleanResult (if QT = boolean, null otherwise)
NumericResult
FreeFormTextResult
and then a lot of programming logic to handle the display.
Actually, our real soultion goes a stp further and groups questions into QuestionGroups so that similar questions can be kept together, but this simplified version should give you some ideas. It's relatively complex, but I've seen professional survey software that's even more so, but follows a similar pattern. it's also worked out very well for us. We've been able to reuse it for several scenarios by creating a new SurveyType.
The idea is that anything that can be duplicated in a table should be pulled out to another table.
Question would be duplicated, so you'd need a questions table
Response could be duplicated, depends on if this would be multiple choice or fill in the blank, etc.
Value could be duplicated, but depends on the answer from Responses
So, I'd say to think about your questionnaire. Think about what would be saved multiple times in the db and then normalize from that.
Don't worry about performance - normalization will help more than it will hurt.
I am designing a simple database of online exam system. But I can not figure out how the questions and the answers should be stored. I am thinking question and Answer as different entities. There will be both MCQ and short questions in the same question set and the number of questions in a set may be dynamic (choose by teacher).
As I see you're looking for something like this:
User table - everyone who will be answering to the questions. It will have UserId and other profile information - name, class, photo, etc.
Question table - it will have questionid, created by (userid) and text of the question
AnswerOption - it will have optionid, link to question, text of answer option
UserAnswer - it will have useranswerid, questionid, optionid
So for example you have this question: "How much is 2*2?" and answer options are "4", "5", "6".
In this case you will have 1 record in question table and 3 records in AnswerOption table.
Now when someone answers the question, you insert a record into UserAnswer table with respective userid, questionid and optionid.
Is this what you had been looking for?
And of course you should also think how to group questions in test etc.
Since you've got multi-choice questions (MCQ, I assume), you need to consider carefully whether the alternatives in an MCQ are part of the question or are answers with a status (wrong, part of the right answer, correct). If a question has multiple answers, keep them in separate tables. If a question has just one answer, then keep them in a single table.
Start from your smallest item. A question can have multiple answer choices, one of which is good. So you could have an answer table.
ANSWER:
AnswerID
QuestionID
Choice
Text
Good (boolean qualifier)
QUESTION:
QuestionID
Text
Points
This is just a suggestion. It all depends on what you want to do. But first, you lay out by categories what your items are. Prefer loose leaf paper.