Induced pluripotent stem cells (iPSCs) and human embryonic stem cells (hESCs) differentiated into hepatocyte-like cells (HLCs) provide a defined and renewable source of cells for drug screening, toxicology and regenerative medicine. We previously reprogrammed human fetal foreskin fibroblast cells (HFF1) into iPSCs employing an episomal plasmid-based integration-free approach, this iPSC-line and the hESC lines H1 and H9 were used to model hepatogenesis in vitro. Biochemical characterisation confirmed glycogen storage, ICG uptake and release, urea and bile acid production, as well as CYP3A4 activity. Microarray-based transcriptome analyses was carried out using RNA isolated from the undifferentiated pluripotent stem cells and subsequent differentiation stages- definitive endoderm (DE) hepatic endoderm (HE) and HLCs. K-means identified 100 distinct clusters, for example, POU5F1/OCT4 marking the undifferentiated stage, SOX17 the DE stage, HNF4a the HE stage, and ALB specific to HLCs, fetal liver and primary human hepatocytes (PHH). This data descriptor describes these datasets which should be useful for gaining new insights into the molecular basis of hepatogenesis and associated gene regulatory networks.