Fixup/oracle dates #810

detule · 2024-05-28T01:28:17Z

Fixes long-standing ORACLE issues with using batch sizes greater than one when writing to tables with DATE and TIMESTAMP fields.

See, for example #349, #350, #391

simonpcouch

Unable to push this around locally before the upcoming release, but these changes feel like a step forward for me!

R/driver-oracle.R

tests/testthat/test-driver-oracle.R

simonpcouch · 2024-05-28T13:24:15Z

R/driver-oracle.R

- dbGetQuery(conn, query)
+ res <- dbGetQuery(conn, query)
+
+ res$data_type <- as.numeric(res$data_type)


Do we need to revert this column back to another type after processing it?

Hey Simon:

Good question - I think the query can/does return all NULLs for that field, which in turn I think gives a column full of NA_character_ in the data-frame.

Co-authored-by: Simon P. Couch <[email protected]>

hadley · 2024-06-04T17:57:31Z

R/driver-oracle.R

+ res[res$field.type == "DATE", c("data_type", "column_size")] <- c(91, 6)
+ res[grepl("TIMESTAMP", res$field.type), c("data_type", "column_size")] <- c(93, 16)


This is a very minor quibble but I don't trust this sort of cross-column subset-assignment, so I'd prefer to see something more like:

res$data_type[res$field.type == "DATE"] <- 91 res$column_size[res$field.type == "DATE"] <- 6

I suspect that might also eliminate the need for as.numeric(res$data_type) above.

Thanks. Done - let me know if that's not what you had in mind.

I think the as.numeric may still be needed, but happy to update if you guys think there's a better way.

┃ Browse[1]> res <- dbGetQuery(conn, query) ┃ Browse[1]> str(res) ┃ 'data.frame': 5 obs. of 17 variables: ┃ $ name : chr "datetime" "date" "integer" "double" ... ┃ $ field.type : chr "TIMESTAMP(6)" "DATE" "NUMBER" "BINARY_DOUBLE" ... ┃ $ table_name : chr "test_table" "test_table" "test_table" "test_table" ... ┃ $ schema_name : chr "SA" "SA" "SA" "SA" ... ┃ $ catalog_name : chr NA NA NA NA ... ┃ $ data_type : chr NA NA NA NA ... ┃ $ column_size : num NA NA NA NA 255 ┃ $ buffer_length : num 11 16 40 8 255 ┃ $ decimal_digits : num 6 NA 0 NA NA ┃ $ numeric_precision_radix: chr NA NA NA NA ... ┃ $ remarks : chr NA NA NA NA ... ┃ $ column_default : chr NA NA NA NA ... ┃ $ sql_data_type : chr NA NA NA NA ... ┃ $ sql_datetime_subtype : chr NA NA NA NA ... ┃ $ char_octet_length : num 0 0 0 0 255 ┃ $ ordinal_position : num 1 2 3 4 5 ┃ $ nullable : num 1 1 1 1 1 ┃ Browse[1]> isDate <- res$field.type == "DATE" ┃ res$data_type[isDate] <- 91 ┃ res$column_size[isDate] <- 6 ┃ isTimestamp <- grepl("TIMESTAMP", res$field.type) ┃ res$data_type[isTimestamp] <- 93 ┃ res$column_size[isTimestamp] <- 16 ┃ Browse[1]> str(res) ┃ 'data.frame': 5 obs. of 17 variables: ┃ $ name : chr "datetime" "date" "integer" "double" ... ┃ $ field.type : chr "TIMESTAMP(6)" "DATE" "NUMBER" "BINARY_DOUBLE" ... ┃ $ table_name : chr "test_table" "test_table" "test_table" "test_table" ... ┃ $ schema_name : chr "SA" "SA" "SA" "SA" ... ┃ $ catalog_name : chr NA NA NA NA ... ┃ $ data_type : chr "93" "91" NA NA ... ┃ $ column_size : num 16 6 NA NA 255 ┃ $ buffer_length : num 11 16 40 8 255 ┃ $ decimal_digits : num 6 NA 0 NA NA ┃ $ numeric_precision_radix: chr NA NA NA NA ... ┃ $ remarks : chr NA NA NA NA ... ┃ $ column_default : chr NA NA NA NA ... ┃ $ sql_data_type : chr NA NA NA NA ... ┃ $ sql_datetime_subtype : chr NA NA NA NA ... ┃ $ char_octet_length : num 0 0 0 0 255 ┃ $ ordinal_position : num 1 2 3 4 5 ┃ $ nullable : num 1 1 1 1 1

This is a bit of a function of the silly way the data_type column is formulated in the query ( always NULL ) - we can probably do something better to make sure that the column comes back as numeric, but there is also probably some value in keeping the query as close as the original SQLColumns implementation for the OEM oracle driver.

detule added 3 commits May 28, 2024 01:23

oracle: Ability to batch-write dates and datetimes.

b0dd680

Add NEWS entry

62113f3

fixup: remove todos

5a6d5ef

detule requested review from simonpcouch and hadley May 28, 2024 01:30

simonpcouch approved these changes May 28, 2024

View reviewed changes

detule and others added 2 commits June 2, 2024 12:02

Update tests/testthat/test-driver-oracle.R

0f35f0f

Co-authored-by: Simon P. Couch <[email protected]>

code-review: Better explanation

d120937

hadley approved these changes Jun 4, 2024

View reviewed changes

code-review: style

6e5facf

detule mentioned this pull request Jun 6, 2024

Oracle - dbWriteTable not working with date and timestamp #813

Closed

Merge branch 'main' into fixup/oracle_dates

2c81e0f

detule merged commit 3b65fb2 into r-dbi:main Jun 17, 2024
16 of 17 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixup/oracle dates #810

Fixup/oracle dates #810

detule commented May 28, 2024

simonpcouch left a comment

simonpcouch May 28, 2024

detule Jun 2, 2024

hadley Jun 4, 2024

detule Jun 5, 2024

		res[res$field.type == "DATE", c("data_type", "column_size")] <- c(91, 6)
		res[grepl("TIMESTAMP", res$field.type), c("data_type", "column_size")] <- c(93, 16)

Fixup/oracle dates #810

Fixup/oracle dates #810

Conversation

detule commented May 28, 2024

simonpcouch left a comment

Choose a reason for hiding this comment

simonpcouch May 28, 2024

Choose a reason for hiding this comment

detule Jun 2, 2024

Choose a reason for hiding this comment

hadley Jun 4, 2024

Choose a reason for hiding this comment

detule Jun 5, 2024

Choose a reason for hiding this comment