Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
resarch:nlpa:paper_1 [2014/09/18 18:10] preethac |
resarch:nlpa:paper_1 [2014/10/02 14:43] (current) preethac |
||
---|---|---|---|
Line 3: | Line 3: | ||
\\ Martin P. Robillard | \\ Martin P. Robillard | ||
- | [[http://www.google.com/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1&ved=0CB4QFjAA&url=http%3A%2F%2Fwww.cs.mcgill.ca%2F~martin%2Fpapers%2Ficse2013.pdf&ei=ZVYbVKb_Gs7qoATytYG4Aw&usg=AFQjCNEM0UobXT16enNDJbSsrcAXjiMuiQ&bvm=bv.75097201,bs.1,d.b2U]] | + | [[http://www.google.com/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1&ved=0CB4QFjAA&url=http%3A%2F%2Fwww.cs.mcgill.ca%2F~martin%2Fpapers%2Ficse2013.pdf&ei=ZVYbVKb_Gs7qoATytYG4Aw&usg=AFQjCNEM0UobXT16enNDJbSsrcAXjiMuiQ&bvm=bv.75097201,bs.1,d.b2U|Link]] |
+ | |||
+ | |||
+ | **Problem:** | ||
+ | Determining the code elements salient to an informal developer communications, such as forum posts. | ||
+ | |||
+ | |||
+ | **Importance/Applications of the technique:** | ||
+ | \\ Does not require an index of code elements. | ||
+ | \\ Discovers code elements based on salience. | ||
+ | |||
+ | **Approach:** | ||
+ | \\ 1. Developed a tool "Automatic Code Element Extractor (ACE)" which does the following steps:- | ||
+ | * Identify code like terms from a document using Island Parser | ||
+ | * Creates index of valid code elements | ||
+ | * Reparses documents to identify ambiguous terms and resolve them | ||
+ | * Outputs the code elements | ||
+ | |||
+ | 2. Created a classifier of code element salience | ||
+ | |||
+ | |||
+ | |||
+ | **Evaluation:** | ||
+ | P= 0.92 | ||
+ | R= 0.90 | ||
+ | |||
+ | |||
+ | **Critique** | ||
+ | |||
+ | Strengths: | ||
+ | Discards closed world assumption(where an index is created from before) | ||
+ | Discards uniform importance assumption(previously each code element would have been considered equally important) | ||
+ | |||
+ | |||
+ | Weaknesses: | ||
+ | Unable to identify indirectly referenced code elements(eg: answers to questions in stack overflow is considered not salient, hence miss indirectly referenced code elements which might be salient) | ||
+ | | ||