Enhancing machine classifiers with NLP techniques
| ||||||
We have developed a series of machine classifiers to help us analyze, in real-time, the interesting discussion moves made by collaborating students in an e-Discussion. An essential step here is the extraction of features from discussion contributions which are amenable to machine analysis. Given the nature of the task, these features comprise structural aspects of the e-Discussion graphs (e.g., the number of in and out links of nodes in the e-Discussion graph) as well as language characteristics which are currently extracted by means of shallow text processing (e.g., unigrams, bigrams, punctuation). The task in this Masters Thesis project is to explore and test possibilities to improve the classifiers by more advanced language processing techniques (e.g. use of a semantic lexicon like WordNet, Named Entity Recognition, etc). The task would involve: (1) Reading and reviewing the relevant literature (e.g., text and dialogue act classification) (2) Finding appropriate NLP libraries and tools (3) Pre-processing the existing data so that the tools and libraries can be applied to the data (4) Running the experiments with different settings (baseline: the already existing classifiers) (5) Interpreting the results, especially indicating whether the new approaches have been successful (6) Describing possible future work |
Using LSA to analyze e-Discussions
| ||||||
One major problem in the analysis of natural language texts is the existence of an almost unlimited number of ways to express the same message. Latent Semantic Analysis (LSA) is one approach to remedy this problem by mapping single words and texts to a semantic space in which texts with similar content appear near to each other, even if both texts do not share a single common word. Prior work in the area of dialogue act classification harnessed LSA to classify new dialogue moves using a nearest neighbor approach. Other usages of LSA include the identification of correct and incorrect aspects of student answers in human-machine dialogues, and the automatized grading of student essays. The task in this Master Thesis project is to apply the same principles to the area of our interest, namely the automated analysis of e-Discussions. Can we find occurances of prototypical arguments? Can we determine whether crucial arguments, which we would expect for a given topic, are missing? Can we assess the overall quality of a discussion? The task is to research these and related questions. The task would involve: (1) Reading and reviewing the relevant literature (LSA basics and applications) (2) Finding / Implementing appropriate tools and libraries to build a LSA space and to run controlled experiments with varying conditions (different background corpora, different LSA parameters) (3) Finding / Collecting appropriate text corpora (general and topic-specific) (4) Pre-processing the existing data so that the tools and libraries can be applied to the data (5) Running the experiments with different settings (6) Interpreting the results, especially indicating whether the approaches have been successful (7) Describing possible future work |
Learning by knowing different solution paths
| ||||||
In ActiveMath students can exercise interactively and study worked examples. In some cases it is impossible to determine the solution path a student has chosen just by looking at the (correct) result. Therefore, the attribution to the correct competency is hard to determine. In order to overcome this limitation, and further extend students' ability to see and use different solution path to solve the same problem, students' can be asked to specify their solution path from several possibilities provided. Additionally they can be requested to solve the same problem using an alternative solution path. The task: (1) design/select exercises that allow for the multiple solutions, (2) extend the exercise and worked example representation so that it allows for alternative solutions, (3) develop an extension of ActiveMath that provides students with alternative solutions of worked examples and exercises upon request as well as unsolicited and possibly asks the student to compare the solutions, (4) test the extension of ActiveMath with students, (5) use the resulting log files and Machine Learning techniques to evaluate the benefit of the extension. Possible subsequent research: (1) Does this approach lead to better perfomance? (2) Does the confrontation with additional solution paths increase knowledge transfer to similar problems? |
Peer Authoring
| ||||||
The goal of this master thesis is to develop a simplified authoring tool that can be easily used by students to: (1) vary a given interactive exercise or exercise template, (2) author an exercise, a text, add links for peer students, (3) add or author explanations to a worked solution of an exercise, (4) add or author feedback in exercises, (5) add useful visualizations and facilities to put the new content into a content collection so that ActiveMath can present it to a group of peers. Each student from the group should be alerted about contribution of peers. Web 2.0 techniques can be used as well as existing components of ActiveMath including the authoring tool Extasy. |
interactive Concept Mapping
| ||||||
Concept mapping is a well known cognitive technique for visualising and constructing relationships between different concepts. It bases on the assumption that the human brain consists of a (hierarchically) structured network of concepts and that concept maps are a visual externalisation of this structure. Hence, concept mapping is known for stimulating analytical and reflective thinking. ActiveMath provides an interactive concept mapping tool called iCMap. In the past two years, we have evaluated it and acquired a lot of new insights which contributed to the decision to reimplement iCMap. Hence, we have some good starting point literature, however, this task is still ambitious. A perfect candidate for this task is a computer scientist with either psychology or teaching education as minor subject. We plan to make evaluations towards the end of the thesis. There are a lot of questions to investigate, be it on usability or on learning effects. The technical requierements are: we want a rich client Java application. Its architecture should be modular as much as possible to be able to extend it or to replace components by new ones. It should include a rule-based engine which simplifies authoring of inference rules for checking consistency, transitivity, etc. The rules should be stated in a declarative way. One possibility could be to use JBoss Rules, another to use some AI techniques. We'd like to add a more intuitive suggester mechanism and a more suitable layout algorithm (Piccolo framework could be nice). The tool should be sexy, so if you like to play with Java 2D, this is your playground. |
Competency Matching Framework
| ||||||
Example scenario: Assume, you are searching for a job or for a course. Both require prior knowledge which can be expressed by competencies. In a simple case, you would do a match between the competencies you have and the required competencies. In a more complex case, you want to know your gaps. The gaps ususally cannot be closed just by one course. Instead, you will have to go through an education chain of courses to close the gap. In this master thesis you will develop a competency matching framework including a semantic competency model, a simple exact matching algorithm, a matching algorithm that involves different types of competency similarieries, competency substitutions, competency subsumptions, and a competency measure. The information about courses and jobs will be provided via a database. The information about an individual's competencies stems from example e-portfolios. Integration of an open-source spread-sheet system with the web-based learning environment. This can be realized as client-based tool or as web-service. In addition, the tool has to be extended such that it can provide feedback and help to the learner. |
Data mining for supporting the diagnosis of solution-strategies
| ||||||
Based on a training set of textual data of learning diaries (and potentially on certain slot descriptions) annotated with human diagnosis of solution strategies the task is to apply/invent text mining and/or data mining techniques for classifying a new learner's strategy. |
Neue adaptive Szenarien für Lernumgebung ActiveMath
| ||||||
Zu entwickeln ist ein Hausaufgabenszenario für das intelligente Lernsystem ActiveMath. Bei diesem werden nach mehreren Kriterien adaptiv Aufgaben für Hausaufgaben in Mathematik zusammengestellt. Dieses soll nach im SFB-Schwerpunkt 'Bildungsqualität Schule' erarbeiteten pädagogischen Vorstellungen modelliert werden. Kontakte und Literatur sind gegeben: Mathematik Lehren 140, 2007, Thema Hausaufgaben. Zu erweitern ist das Lernszenario 'Prüfungsvorbereitung' so dass es nicht nur zu bestimmten Themen/Konzepten adaptiert wird, sondern auch bestimmte Typen von Aufgaben/Kompetenzen vorbereitet sowie auf eine Menge von potentiellen Prüfungsfragen/Aufgaben zugeschnitten werden kann. Dieses Szenario soll in einer Klasse getestet werden gegen das derzeitige nur auf Konzepte adaptierbare Prüfungsszenario. |
Assembly of Learning Items
| ||||||
ActiveMath provides means for authors to assembly books based on learning items stored in ActiveMath's repository. Another way to create a course is to ask the course generator to build a tailored book according to the learner's goals and current knowledge. However, both ways do not provide the facility for students to create their own books and to work with content and its structure actively. This task is solved by ActiveMath's assembly tool. However, this tool has much more potential than the mere assembly of ActiveMath's learning items. And this is your task: enhance the assembly tool! Provide sharing of books, integration of external objects, and means for collaboration. Distinguish between an assembly tool for students and one for teachers. |
Gender Related Adaptation
| ||||||
|
Integration of a spread-sheet system into ActiveMath
| ||||||
Integration of an open-source spread-sheet system with the web-based learning environment. This can be realized as client-based tool or as web-service. In addition, the tool has to be extended such that it can provide feedback and help to the learner. |
AJAX-powered mathematical text-input
| ||||||
The objective is to realize a non-obstrusive text-input tool for semantic mathematical formulæ producing XML output in OpenMath or content-mathml with the following characteristics: web-based, using a simple text-field sturdy to parse incompletely input formulae presenting asymchronously a 2d-rendering of the formula (using, e.g., ActiveMath's presentation from OpenMath) able to report errors and bring user to error location in input able to let user select a location or fragment in the rendering triggering a selection in the text-field, and vice-versa as much as browsers provide access to able to receive drop or paste of XML or URL-to-it and convert it to the input syntax extensible to new input patterns This Bachelor work is stimulated by our experimental results which indicates that the easy access of plain-text-input goes beyond the confort of use of input-palettes. We expect this work to yield an open-source product licensed with a non-viral license (for example under the Apache Public License) and expect demand at cooperating researchers. |
Production and treatment of erroneous math-examples
| ||||||
|
Exercise Generation with verbalization
| ||||||
The generation of exercises in natural language will use templates for the mathematical content and enhance these by ontologies of several application domains (so that mathematically same exercises can be stated, e.g., for medical situations, technical situations, demographic situations etc). This is the basis for NL macro-planning and will be combined with a NL-micro planner that returns the actual natural language formulation of the exercise. |
Exercise evaluators and adaptive feedback
| ||||||
|
Modelling Diagnose and Misconceptions in Fractional Arithmetic
| ||||||
|
Agent-based evaluation and feedback mechanism
| ||||||
|
Emotions in ActiveMath
| ||||||
|
Diagnosis-system for fraction computations
| ||||||
|
Added Value Mathematical Formulae Presentation
| ||||||
ActiveMath presents mathematical content to a browser from a semantically encoded content. Among others, this allows the mathematical formulae to be presented with tooltips, click-to-explain-the-symbol, or copy-and-paste, all from the simple fact of enconding semantically the formulae (using OQMath, OMDoc, and OpenMath). We wish to extend this ``value-added presentation of formulae'' by considering supplementary input of the author. An example would be annotations about the variables in a well-defined context (e.g. ``m is a the mass of the ball''), another avenue might be the usage of type-information about the symbols. The bachelor work should include the enrichment of a 10-15 pages ``book'' in order to convince mathematicians of the well-foundedness of the approach. |
Semantic conversion of presentation mathematical documents
| ||||||
The goal is to use existing tools such as WebEQ, HELM's and many others to provide the automated conversion of presentation-encoded mathematical documents, typically easily convertible to MathML- presentation or TeX, to the semantic format of OMDoc, including OpenMath for mathematical expressions. The breadth and generalizability of this conversion defines the grade: In the case of a master's thesis, it would be expected that most of the wikipedia content is convertible. Alternatively, thorough coverage of thematic segments of the abstracts of Zentralblatt Math should be convertible (the latter being the biggest and oldest mathematical research litterature review service). n the case of a bachelor thesis, a subset of this is expected, for example a thematic part of wikipedia. |