In trying to understand the big picture of how users learn to program in App Inventor, we want to be able to represent projects in a way suitable for large scale learning analytics. Here I present different representations of projects that could potentially be used to identify App Inventor projects that have structural similarities to each other, e.g., projects created by users following tutorials. I compare the different representations based solely on how accurately they predict the correct tutorial from a labeled data set. The results suggest that we use both blocks and components from a project, apply TF-IDF to the counts of each feature, and measure distance or similarity in terms of a generalized Jaccard distance. This work lays the foundation for being able to find clusters of similar projects to distinguish original from unoriginal projects and to be able to filter out similar projects when trying to determine a user's skill level.