Differences
This shows you the differences between two versions of the page.
|
resarch:nlpa:paper_2 [2014/09/18 18:22] preethac created |
resarch:nlpa:paper_2 [2014/09/25 16:10] (current) preethac |
||
|---|---|---|---|
| Line 4: | Line 4: | ||
| [[http://serl.clarkson.edu/site/wp-content/uploads/2013/04/icpc2013.pdf|Link]] | [[http://serl.clarkson.edu/site/wp-content/uploads/2013/04/icpc2013.pdf|Link]] | ||
| + | |||
| + | **Problem:** identify problematic API design features automatically | ||
| + | |||
| + | **Importance/Applications of the technique:** | ||
| + | |||
| + | Enable speed reading of forums for problems | ||
| + | Enable queries for search for problematic features | ||
| + | Estimating API hot topics | ||
| + | |||
| + | **Approach:** | ||
| + | |||
| + | assumption is that hot topics in forums about a given API that are discussed frequently are | ||
| + | |||
| + | **problematic API features** | ||
| + | |||
| + | assumption is that problematic API features are discussed in negative sentences and its neighbors | ||
| + | approach was to identify negative sentences, then extract features from negative sentence +2 and -1 sentences | ||
| + | negative sentences are identified by sentiment140 to categorize sentences as negative, positive, neutral | ||
| + | then use Stanford NLP tools to extract word phrases with API dictionary words in it | ||
| + | create dictionary from SWING tutorial to get the specific API features - closed world assumption | ||
| + | Developed a tool called Haystack | ||
| + | |||
| + | **Evaluation:** | ||
| + | |||
| + | measured precision against gold set from Swing api | ||
| + | showed high precision | ||
| + | did not measure recall | ||
| + | ** | ||
| + | Critique** | ||
| + | |||
| + | Strengths: | ||
| + | important problem of extracting information about developer's issues with API usage | ||
| + | use of sentiment analysis techniques that exist for identifying negative sentences | ||
| + | interesting approach and categorization of sentences into several categories - maybe useful for us to automate | ||
| + | |||
| + | Weaknesses: | ||
| + | first author created gold set and second author confirmed it | ||
| + | precision was measured with an inappropriate match between gold set and extracted feature words in sentence sets | ||
| + | |||
| + | thus the high precision numbers reported are not really true | ||
| + | |||
| + | the premise and title that these kinds of things they are extracting are problematic API features is not really accurate and should be relabeled as things such as how-tos, problems, etc | ||
| + | |||