Differences

This shows you the differences between two versions of the page.

Link to this comparison view

resarch:nlpa:paper_2 [2014/09/18 18:22]
preethac created
resarch:nlpa:paper_2 [2014/09/25 16:10] (current)
preethac
Line 4: Line 4:
  
 [[http://​serl.clarkson.edu/​site/​wp-content/​uploads/​2013/​04/​icpc2013.pdf|Link]] [[http://​serl.clarkson.edu/​site/​wp-content/​uploads/​2013/​04/​icpc2013.pdf|Link]]
 +
 +**Problem:​** identify problematic API design features automatically
 +
 +**Importance/​Applications of the technique:​**
 +
 +    Enable speed reading of forums for problems
 +    Enable queries for search for problematic features
 +    Estimating API hot topics
 +
 +**Approach:​**
 +
 +    assumption is that hot topics in forums about a given API that are discussed frequently are
 +
 +**problematic API features**
 +
 +    assumption is that problematic API features are discussed in negative sentences and its neighbors
 +    approach was to identify negative sentences, then extract features from negative sentence +2 and -1 sentences
 +    negative sentences are identified by sentiment140 to categorize sentences as negative, positive, neutral
 +    then use Stanford NLP tools to extract word phrases with API dictionary words in it
 +    create dictionary from SWING tutorial to get the specific API features - closed world assumption
 +    Developed a tool called Haystack
 +
 +**Evaluation:​**
 +
 +    measured precision against gold set from Swing api
 +    showed high precision
 +    did not measure recall
 +**
 +Critique**
 +
 +    Strengths:
 +        important problem of extracting information about developer'​s issues with API usage
 +        use of sentiment analysis techniques that exist for identifying negative sentences
 +        interesting approach and categorization of sentences into several categories - maybe useful for us to automate
 +
 +    Weaknesses:
 +    first author created gold set and second author confirmed it
 +    precision was measured with an inappropriate match between gold set and extracted feature words in sentence sets
 +
 +thus the high precision numbers reported are not really true
 +
 +    the premise and title that these kinds of things they are extracting are problematic API features is not really accurate and should be relabeled as things such as how-tos, problems, etc
 +
resarch/nlpa/paper_2.1411078923.txt.gz ยท Last modified: 2014/09/18 18:22 by preethac
Driven by DokuWiki Recent changes RSS feed Valid CSS Valid XHTML 1.0