Statistical and Computational Approaches for Data Integration and Constrained Variable Selection in Large Datasets