State University of New York at Albany 2013 - 2015
Bachelor of Science, Computer Science - May 2015
Masters Level Courses
Review of discrete mathematics and probability used in algorithm design and analysis. Analyses of divide-and-conquer algorithms, number-theoretical algorithms, and algorithms for sorting, searching, and order statistics. Basic data structures. Basic graph algorithms for topological sorting, constructing minimum cost spanning trees, and shortest path problems.
- Advanced Programming Concepts
The main focus of the course is on declarative programming and axiomatic semantics. In declarative programming we cover the Functional Programming (FP) and the Logic Programming (LP) paradigms. The emphasis in functional programming will be on programming with higher-order functions, and polymorphism as in the programing language ML. In Logic Programming we cover unification, SLD-resolution and backtracking, as in Prolog. Also covered are Floyd-Hoare axiomatic semantics, and verification of programs.
- Database Systems I
Basic concepts of a database system, with emphasis on database schema design, data manipulation languages, and query languages. Data models. Relational databases, functional dependencies, normal forms. Relational calculus, relational algebra, SEQUEL, Query-by-Example. The CODASYL network model: data definition and data manipulation languages.
Thelma Miller Award
Receiver of the Undergraduate Research Endowed Fellowship, awarded by the Ken W. and Thelma Miller Abele Endowed Fund. This is awarded to the top few undergraduate researchers determined by the research committee.
Hudson Valley Community College 2009 - 2011
Acquired 68 credit hours in an attempt to transfer into a Computer Science program at a 4 year institution.
We The People Petitioning Platform
Since September 2011, the Whitehouse has been accepting and responding to various petitions introduced by citizens across the world in an attempt to give Democracy a place on the internet. This introduced many interesting research questions that our group attempted to answer. Including monitoring co-signing behavior and graphing social networks, finding communities within signers using Community Detection, predicting success through Natural Language Processing, and trying to determine effectiveness of this petitioning platform.
i2b2 Medical Record Deidentification
Our group dealt with unformatted medical records, particularly trying to ascertain which data is private health information and which isn't. The purpose of this is to achieve a high score in deidentifying documents using Natural Language Processing.
- Dumas, C., LaManna, D., Harrison, T.M., Ravi, S.S., Kotfila, C., Gervais, N., Hagen, L., & Chen,F. 2014. Examining political mobilization of online communities through e-petitioning behavior in We the People. Under review for Big Data and Society Journal. (2014).
- Dumas, C., LaManna, D. Harrison, T.M., Ravi, S.S., Kotfila, C., Gervais, N., Hagen, L., and Chen, F. 2014. Examining Political Mobilization of Online Communities through E-Petitioning Behavior in We The People (extended abstract) Presented at the Social Media & Society Conference, July 27-29, 2014,Toronto, Canada,.
- Dumas, C., LaManna, D., Harrison, T. M., Ravi, S.S., Kotfila, C., and Chen F. (2015). E-petitioning as collective political action in We the People. Accepted to the iConference, March 24-27, Newport Beach, CA.
- Hagen, L. et al. under review. Introducing Semantic Tools for Policy Analysis: A Case Study of E-petitions. (under review).
- Hagen, L. Uzuner, O., Kotfila, C., Harrison, T.M., and LaManna, D. 2015. Understanding Citizens’ Direct Policy Suggestions to the Federal Government: A Natural Language Processing and Topic Modeling Approach. 2015 48th Hawaii International Conference on System Sciences (HICSS) (Kauai, HI, 2015).
- Harrison, T.M. Dumas, C, Kotfila, C., LaManna, d., Ravi, S.S. 2014. We the People: U.S. E-Petitioning as Technology-Mediated Social Action. Presented at the 64th Annual Conference of the International Communication Association (ICA) Seattle, WA.
Research Programmer, SUNY Research Foundation June 2013 - January 2015
- Performed Natural Language Processing on unformatted medical records.
- Created and documented toolset for use with unformatted medical records: https://github.com/danlamanna/i2b2tools
- Used tools which implemented machine learning methods such as ensemble and SVM’s to combine known private health information deidentifiers.
- Performed various analyses on datasets (market basket, community detection, etc)
- Managed small team of researchers in executing technical tasks to increasing our research data store.
- Technologies involved were Python, Java, Git, MySQL, and IPython notebook to ensure research reproducibility.
Lead Developer, Burst Marketing January 2011 - March 2014
- Managed selection of tools and technologies on a per project basis.
- Created custom WordPress plugins, Magento modules, and CodeIgniter libraries to spec per stakeholders’ expectations.
- Exposed to various design patterns, MVC, Listener/Observer, Singleton, etc.
- Designed internal tools to handle large volume of PHP codebases and managing continuous updates via Git and SVN.
- Implemented sandboxed environments across distributed development teams using Vagrant, Puppet, and Python.
- Administered small cluster of servers to ensure uptime and proper configurations throughout.
Open Source Project History
For a more complete list, see my github account.
Magit Contribution :emacs:lisp:
I contributed to magits git-svn integration in order to speed up my development for projects still on Subversion while at Burst Marketing.
Advanced Post Privacy :php:wordpress:
I created a basic but flexible WordPress plugin to fill a gap in functionality I felt was missing from core.
This ended up being useful on multiple future projects I had worked on.
I2b2 Toolset :python:nlp:i2b2:
I created a set of tools as a sort of capstone to my research with natural language processing. It's designed for use with unformatted medical records, of i2b2 XML format.
Django - The Python Web Framework
Open Source Festival (SUNY Albany) 2013
In web development a number of tasks are performed over and over from application to application. These tasks require a lot of groundwork that shifts developers' focus away from the business logic of their applications. These tasks can be abstracted, simplified, and automated to the point where they can be quickly performed allowing developers to get to the heart of the application. That is the goal of modern web frameworks.
Django is such a web framework written in Python. It encourages rapid development of web applications and began as a framework used largely in the news industry. It is entirely open source and community driven.
I will primarily discuss how Django works behind the scenes, such as using the Model View Template software pattern to seperate business logic from presentation code, Object Relational Mapping to simplify database interaction, as well as several pre-built user interfaces that have been integrated to provide a better experience for developers and clients. Altogether this should present a clear view as to how Django can encourage rapid application development, as well as provide value to its users. This presentation will include a demonstration of a simple web application written in Django.