We consider the -armed stochastic contextual bandit problem with dimensional features, when both and can be large. To the best of our knowledge, all existing algorithm for this problem have a regret bound that scale as polynomials of degree at least two in and . The main contribution of this paper is to introduce and theoretically analyze a new algorithm (REAL-bandit) with a regret that scales by when is rank of the matrix of unknown parameters. REAL-bandit relies on ideas from low-rank matrix estimation literature and a new row-enhancement subroutine that yields sharper bounds for estimating each row of the parameter matrix that may be of independent interest.