REFERENCES
[1] R. Agrawal and R. Srikant. Fast algorithms for mining association rules
in large databases. In Proceedings of the International Conference on
Very Large Data Bases, pages 487–499, 1994.
[2] M. Aiello. The Web Was Done by Amateurs: A Reflection on One of
the Largest Collective Systems Ever Engineered. Springer, 2018.
[3] S. S. Alqahtani, E. E. Eghan, and J. Rilling. Recovering semantic trace-
ability links between APIs and security vulnerabilities: An ontological
modeling approach. In Proceedings of the International Conference on
Software Testing, Verification and Validation, 2017.
[4] M. Aniche, G. Bavota, C. Treude, M. A. Gerosa, and A. Deursen.
Code smells for model-view-controller architectures. Empirical Software
Engineering, 23(4):2121–2157, 2018.
[5] M. Aniche, C. Treude, I. Steinmacher, I. Wiese, G. Pinto, M.-A. Storey,
and M. A. Gerosa. How modern news aggregators help development
communities shape and share knowledge. In Proceedings of the Inter-
national Conference on Software Engineering, pages 499–510, 2018.
[6] G. Antoniol, B. Caprile, A. Potrich, and P. Tonella. Design-code
traceability for object-oriented systems. Annals of Software Engineering,
9(1-4):35–58, 2000.
[7] S. Baltes and S. Diehl. Usage and attribution of stack overflow code
snippets in github projects. Empirical Software Engineering, pages 1–
37, 2018.
[8] S. Baltes, L. Dumani, C. Treude, and S. Diehl. Sotorrent: Reconstructing
and analyzing the evolution of stack overflow posts. In Proceedings of
the International Conference on Mining Software Repositories, pages
319–330, 2018.
[9] J. M. Barrie and D. E. Presti. Digital plagiarism-the web giveth and the
web shall taketh. Journal of medical Internet research, 2(1), 2000.
[10] F. Beuke. Github Language Statistics – GitHut 2.0. https://madnight.
github.io/githut/. [Online; accessed Aug 2018].
[11] F. Chen and S. Kim. Crowd debugging. In Proceedings of the Joint
Meeting on the Foundations of Software Engineering, 2015.
[12] D. M. German, Y. Manabe, and K. Inoue. A sentence-matching method
for automatic license identification of source code files. In Proceedings
of the International Conference on Automated Software Engineering,
pages 437–446, 2010.
[13] GitHub. The State of the Octoverse 2017. https://octoverse.github.com/,
2017. [Online; accessed Aug 2018].
[14] M. Goldman and R. C. Miller. Codetrail: Connecting source code and
web resources. Journal of Visual Languages & Computing, 20(4):223–
235, 2009.
[15] C. G
´
omez, B. Cleary, and L. Singer. A study of innovation diffusion
through link sharing on stack overflow. In Proceedings of the Working
Conference on Mining Software Repositories, pages 81–84, 2013.
[16] G. Gousios. The ghtorent dataset and tool suite. In Proceedings of the
Working Conference on Mining Software Repositories, pages 233–236,
2013.
[17] M. Klein, H. Van de Sompel, R. Sanderson, H. Shankar, L. Balakireva,
K. Zhou, and R. Tobin. Scholarly context not found: one in five articles
suffers from reference rot. PloS one, 9(12):e115253, 2014.
[18] W. Koehler. Web page change and persistence—a four-year longitudinal
study. Journal of the American Society for Information Science and
Technology, 53(2):162–171, 2002.
[19] R. V. Krejcie and D. W. Morgan. Determining sample size for research
activities. Educational and psychological measurement, 30(3):607–610,
1970.
[20] A. La. Language Trends on GitHub – The GitHub Blog. https:
//blog.github.com/2015-08-19-language-trends-on-github/, 2015. [On-
line; accessed Aug 2018].
[21] E. D. S. Maldonado, R. Abdalkareem, E. Shihab, and A. Serebrenik.
An empirical study on the removal of self-admitted technical debt. In
Proceedings of the International Conference on Software Maintenance
and Evolution, pages 238–248, 2017.
[22] J. Markwell and D. W. Brooks. Broken links: The ephemeral nature
of educational www hyperlinks. Journal of Science Education and
Technology, 11(2):105–108, 2002.
[23] N. Munaiah, S. Kroh, C. Cabrey, and M. Nagappan. Curating github
for engineered software projects. Empirical Software Engineering,
22(6):3219–3253, 2017.
[24] J. Murphy, N. H. Hashim, and P. OConnor. Take me back: validating
the wayback machine. Journal of Computer-Mediated Communication,
13(1):60–75, 2007.
[25] T. H. Nelson. Xanalogical structure, needed now more than ever: parallel
documents, deep links to content, deep versioning, and deep re-use. ACM
Computing Surveys, 31(4es):33, 1999.
[26] S. Onoue, H. Hata, R. G. Kula, and K. Matsumoto. Human capital in
software engineering: A systematic mapping of reconceptualized human
aspect studies. CoRR, abs/1805.03844, 2018.
[27] L. Pascarella and A. Bacchelli. Classifying code comments in java open-
source software systems. In Proceedings of the International Conference
on Mining Software Repositories, pages 227–237, 2017.
[28] A. Potdar and E. Shihab. An exploratory study on self-admitted technical
debt. In Proceedings of the International Conference on Software
Maintenance and Evolution, pages 91–100, 2014.
[29] S. Radevski, H. Hata, and K. Matsumoto. Towards building api usage
example metrics. In Proceedings of the International Conference on
Software Analysis, Evolution, and Reengineering, pages 619–623, 2016.
[30] M. Rahimi and J. Cleland-Huang. Evolving software trace links between
requirements and source code. Empirical Software Engineering, 2017.
[31] M. Rath, J. Rendall, J. L. C. Guo, J. Cleland-Huang, and P. Mder.
Traceability in the wild: Automatically augmenting incomplete trace
links. In Proceedings of the International Conference on Software
Engineering, 2018.
[32] I. K. Ratol and M. P. Robillard. Detecting fragile comments. In
Proceedings of the International Conference on Automated Software
Engineering, pages 112–122, 2017.
[33] M. P. Robillard, A. Marcus, C. Treude, G. Bavota, O. Chaparro, N. Ernst,
M. A. Gerosa, M. Godfrey, M. Lanza, M. Linares-Vsquez, G. C.
Murphy, L. Moreno, D. Shepherd, and E. Wong. On-demand developer
documentation. In Proceedings of the International Conference on
Software Maintenance and Evolution, pages 479–483, 2017.
[34] G. Scanniello, C. Gravino, M. Genero, J. A. Cruz-Lemus, G. Tortora,
M. Risi, and G. Dodero. Do software models based on the UML aid in
source-code comprehensibility? aggregating evidence from 12 controlled
experiments. Empirical Software Engineering, 2018.
[35] J. Sillito, F. Maurer, S. M. Nasehi, and C. Burns. What makes a good
code example?: A study of programming q&a in stackoverflow. In
Proceedings of the International Conference on Software Maintenance,
pages 25–34, 2012.
[36] M.-A. Storey, J. Ryall, R. I. Bull, D. Myers, and J. Singer. Todo or
to bug. In Proceedings of the International Conference on Software
Engineering, pages 251–260, 2008.
[37] L. Tan, D. Yuan, G. Krishna, and Y. Zhou. /*icomment: Bugs or bad
comments?*/. In Proceedings of the Symposium on Operating Systems
Principles, pages 145–158, 2007.
[38] C. Treude, O. Barzilay, and M.-A. Storey. How do programmers ask
and answer questions on the web? (nier track). In Proceedings of
the International Conference on Software Engineering, pages 804–807,
2011.
[39] A. J. Viera, J. M. Garrett, et al. Understanding interobserver agreement:
the kappa statistic. Fam Med, 37(5):360–363, 2005.
[40] C. Wohlin, D.
ˇ
Smite, and N. B. Moe. A general theory of software en-
gineering: Balancing human, social and organizational capitals. Journal
of Systems and Software, 109:229–242, 2015.
[41] E. Wong, J. Yang, and L. Tan. Autocomment: Mining question and
answer sites for automatic comment generation. In Proceedings of the
International Conference on Automated Software Engineering, pages
562–567, 2013.
[42] Y. Wu, Y. Manabe, T. Kanda, D. M. German, and K. Inoue. Analysis
of license inconsistency in large collections of open source projects.
Empirical Software Engineering, 22(3):1194–1222, 2016.
[43] X. Xia, L. Bao, D. Lo, P. S. Kochhar, A. E. Hassan, and Z. Xing. What
do developers search for on the web? Empirical Software Engineering,
22(6):3149–3185, 2017.
[44] M. Zahedi and M. A. Babar. Why does site visit matter in global
software development: A knowledge-based perspective. Information and
Software Technology, 80:36–56, 2016.
[45] F. Zampetti, A. Serebrenik, and M. Di Penta. Was self-admitted technical
debt removal a real removal?: An in-depth perspective. In Proceedings
of the International Conference on Mining Software Repositories, pages
526–536, 2018.