a.1 Assuming you have no statistics and know nothing about the contents of each relation, which plan is likely to cost less? Why? a.2 Assume now that you have the following statistics: LibraryBooks contains 10,000,000 tuples There are 10 different values for the Branch attribute of LibraryBooks There are 100,000 Patrons tuples
• There are 100,000 CheckedOut tuples • Only 5% of checked out books are overdue on average Is the same plan you chose in part a.1 likely to be the cheapest? Why or why not? a.3 Is a real cost-based optimizer likely to choose one of these two plans? Why or why not?
