student.txt 3.1 KB

1234567891011121314151617181920212223242526272829303132333435363738394041
  1. # Attributes for both student-mat.csv (Math course) and student-por.csv (Portuguese language course) datasets:
  2. 1 school - student's school (binary: "GP" - Gabriel Pereira or "MS" - Mousinho da Silveira)
  3. 2 sex - student's sex (binary: "F" - female or "M" - male)
  4. 3 age - student's age (numeric: from 15 to 22)
  5. 4 address - student's home address type (binary: "U" - urban or "R" - rural)
  6. 5 famsize - family size (binary: "LE3" - less or equal to 3 or "GT3" - greater than 3)
  7. 6 Pstatus - parent's cohabitation status (binary: "T" - living together or "A" - apart)
  8. 7 Medu - mother's education (numeric: 0 - none, 1 - primary education (4th grade), 2 – 5th to 9th grade, 3 – secondary education or 4 – higher education)
  9. 8 Fedu - father's education (numeric: 0 - none, 1 - primary education (4th grade), 2 – 5th to 9th grade, 3 – secondary education or 4 – higher education)
  10. 9 Mjob - mother's job (nominal: "teacher", "health" care related, civil "services" (e.g. administrative or police), "at_home" or "other")
  11. 10 Fjob - father's job (nominal: "teacher", "health" care related, civil "services" (e.g. administrative or police), "at_home" or "other")
  12. 11 reason - reason to choose this school (nominal: close to "home", school "reputation", "course" preference or "other")
  13. 12 guardian - student's guardian (nominal: "mother", "father" or "other")
  14. 13 traveltime - home to school travel time (numeric: 1 - <15 min., 2 - 15 to 30 min., 3 - 30 min. to 1 hour, or 4 - >1 hour)
  15. 14 studytime - weekly study time (numeric: 1 - <2 hours, 2 - 2 to 5 hours, 3 - 5 to 10 hours, or 4 - >10 hours)
  16. 15 failures - number of past class failures (numeric: n if 1<=n<3, else 4)
  17. 16 schoolsup - extra educational support (binary: yes or no)
  18. 17 famsup - family educational support (binary: yes or no)
  19. 18 paid - extra paid classes within the course subject (Math or Portuguese) (binary: yes or no)
  20. 19 activities - extra-curricular activities (binary: yes or no)
  21. 20 nursery - attended nursery school (binary: yes or no)
  22. 21 higher - wants to take higher education (binary: yes or no)
  23. 22 internet - Internet access at home (binary: yes or no)
  24. 23 romantic - with a romantic relationship (binary: yes or no)
  25. 24 famrel - quality of family relationships (numeric: from 1 - very bad to 5 - excellent)
  26. 25 freetime - free time after school (numeric: from 1 - very low to 5 - very high)
  27. 26 goout - going out with friends (numeric: from 1 - very low to 5 - very high)
  28. 27 Dalc - workday alcohol consumption (numeric: from 1 - very low to 5 - very high)
  29. 28 Walc - weekend alcohol consumption (numeric: from 1 - very low to 5 - very high)
  30. 29 health - current health status (numeric: from 1 - very bad to 5 - very good)
  31. 30 absences - number of school absences (numeric: from 0 to 93)
  32. # these grades are related with the course subject, Math or Portuguese:
  33. 31 G1 - first period grade (numeric: from 0 to 20)
  34. 31 G2 - second period grade (numeric: from 0 to 20)
  35. 32 G3 - final grade (numeric: from 0 to 20, output target)
  36. Additional note: there are several (382) students that belong to both datasets .
  37. These students can be identified by searching for identical attributes
  38. that characterize each student, as shown in the annexed R file.