Python Sklearn provides classes to train GLM models depending upon the probability distribution followed by the response variable. glm (y~x,family=tweedie (var.power=3,link.power=0)) The tweedie package has several interesting functions for working with Tweedie models including a function to generate random samples. I am confused how to conduct the prediction interval in an appropriate way in GLM(Gamma most probably) in PYTHON instead in R. I have found an related package in R, but I do not want to use R to conduct the interval. The following are 30 code examples for showing how to use statsmodels.api.GLM().These examples are extracted from open source projects. A minimal example drawing a sample would be: import tweedie, seaborn as sns, matplotlib.pyplot as plt tvs = tweedie.tweedie(mu=10, p=1.5, phi=20).rvs(100000) sns.distplot(tvs) plt.show() The package's GitHub pages have a more fancy example. tweedie is a Python library implementing scipy's rv_continuous class for the Tweedie family. I am much less informed in stats and hence cannot give more detailed suggestion for your comments. The MLP was selected by cross-validation of a set of candidate models. The Tweedie GLM is a widely used method for predicting insurance premiums. For this example the function tweedie.profile for finding the mle of the power of the variance function did not converge. The variance power can be estimated using the estimate_tweedie_power function that is part of the statsmodels.genmod.generalized_linear_model.GLM class. GLM can model response variable which follows distribution such as normal, Poisson, Gamma, Tweedie, binomial etc. H2O has Generalized Linear Models. They use H2O Frames though, so you can't use Pandas/Numpy directly. 