Hi, I'm Colm Lang:

Software Engineer

and Computer Science Student @ USF

I'm a results-driven Software Engineer who excels in fast-paced environments, consistently delivering an exceptional rate of development to the teams and projects I work on. My passion centers around projects that harness big data, distributed systems, and data visualization to solve problems with real human impact. Excited to contribute my skills and experience to projects that push boundaries and drive positive change.

Colm Lang

Category

Education


University of San Francisco

Computer Science

2020 - 2024

Relevant Courses


Advanced Network Security (MS), Data Structures and Algorithms (MS), Software Development (MS), Big Data (MS), Computer Architecture, Operating Systems, C & Systems Programming, Systems Security

Category

Skills


GolangJavaCJavaScriptTypeScriptRust

Category

Projects


Project Image

DevOps

Golang Github Action Grading Pipeline

A Github Action that grades student submissions automatically and applies necessary deductions for late submissions. Provides a detailed report of the grading process in the Github Summary markdown file for the action.

TypeScriptGolangGithub ActionsGithub APIMySQL
Project Image

Distributed Systems

Distributed File System & Computation Engine

CDFS (Colm's Distributed File System) is a distributed file system and computation engine that allows users to store and process data in a distributed manner. The system is fault-tolerant and supports replication and sharding of data. The system is also scalable and supports the addition of new nodes to the cluster. Modeled after the papers on HDFS and MapReduce, this system performs user-submitted Golang plugins on the fly and stores the output back into the dfs.

GolangDistributed SystemsBig DataMapReduceHDFS
Project Image

Big Data

Spatiotemporal Climate Analysis

Extracted valuable insights into spatiotemporal climate patterns through in-depth analysis of terabytes of climate data using Apache Spark and Matplotlib for Data-Visualization. Developed efficient Spark jobs to filter, aggregate, and process extensive climate data, optimizing parallel processing for enhanced performance and minimizing computation time on a cluster of 12 nodes.

Apache SparkPythonBig DataData Visualization
Project Image

Full-Stack

Go Get It: Full-Stack Search Engine & Web Crawler

Go Get It is a full-stack search engine and web crawler that allows users to search for web pages and crawl the web for new pages. The system is built using Java and uses a custom-built web crawler to crawl the web. The system also uses a custom-built search engine that uses TF-IDF to rank web pages.

JavaJetty ServletsWeb Crawling