Recruiter Screening Questions
Question #1: How would you join Tables A and B together in order to retrieve names with no matches in the opposing tables?
Table A
Name
ID
Jon
1
Jacob
2
Jingleheimerscmidt
3
Table B
Name
ID
Paul
1
Jacob
2
Jingleheimerscmidt
3
Question #2: From this table we need to keep the newest version of each record using date_added, but would like to keep the oldest this_flag value. All fields, minus date_added and this_flag, are used to determine the uniqueness of a record. How would you go about doing this and what would the end result look like?
Date_Added
this_flag
Name
DOB
ID
May 1st , 2015
Y
Jingleheimerscmidt
19901002
3
May 1st , 2015
N
Jingleheimerscmidt
19901002
3
May 5th, 2015
Y
Jon
19901001
1
May 1st , 2015
N
Jon
19901002
1
May 1st, 2015
Y
Jacob
19901001
2
May 5th, 2015
N
Jingleheimerscmidt
19901001
3
May 1st, 2015
Y
Jingleheimerscmidt
19901001
3
Question #3: For the table given in Question #2, can you give an example fixed width file layout for this table? Also, do the same for XML and CSV.
Question #4: There are 2 fields coming in on a file and we need to verify that the values populated are valid, without explicit direction from business. The two fields are DOB (YYYYMMDD) and SSN (000000000).
Step 1: List the rules you would create to ensure those fields are properly validated.
Step 2: Provide code to create those rules in whatever language you are most comfortable with (though SQL preferred for verification reasons).
Question #5: What is the difference between rebasing and merge in Git?
Question #6: You have just made a commit and found critical issues with the new version, how do you restore back to the previous version?
Question #7: Your team has been working on a new release, you are being tasked with combining and merging all the changes for the next release. Based on the following graph, what series of steps would you take to get to Master 2.0 while including all branches.
Question #8: We would like to migrate our sales report to run on the Azure Computing network, the effort to run these reports is as follows:
Schedule
Regions
Expected Volume
Expected Load
Daily
All
At least 15,000 Transactions
Low
Weekly (Friday 11PM)
North America, Asia
60,000 to 150,000 Transactions
Low but varies heavily
Monthly (Last Day of Month)
All
400,000 to 800,000 Transactions
High with high variation
Bi-Weekly (Wednesday Update to Management)
North America
At least 250,000 Transactions
Low
Quarterly (1st Day of Quarter)
All
Millions or more
Very High
What kind of Computing Solution would you recommend to use to allow for these report to run with an acceptable level of cost and performance, explain your reasoning