Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • SEARCH
  • Home
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 607207
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: May 13, 20262026-05-13T17:20:09+00:00 2026-05-13T17:20:09+00:00

note: this question and the following answers refer to data.table versions < 1.5.3; v.

  • 0

note: this question and the following answers refer to data.table versions < 1.5.3; v. 1.5.3 was released in Feb 2011 to resolve this issue. see more recent treatment (03-2012): Translating SQL joins on foreign keys to R data.table syntax


I’ve been digging through the documentation for the data.table package (a replacement for data.frame that’s much more efficient for certain operations), including Josh Reich’s presentation on SQL and data.table at the NYC R Meetup (pdf), but can’t figure this totally trivial operation out.

> x <- DT(a=1:3, b=2:4, key='a')
> x
     a b
[1,] 1 2
[2,] 2 3
[3,] 3 4
> y <- DT(a=1:3, c=c('a','b','c'), key='a')
> y
     a c
[1,] 1 a
[2,] 2 b
[3,] 3 c
> x[y]
     a b
[1,] 1 2
[2,] 2 3
[3,] 3 4
> merge(x,y)
  a b c
1 1 2 a
2 2 3 b
3 3 4 c

The docs say “When [the first argument] is itself a data.table, a join is invoked similar to base::merge but uses binary search on the sorted key.” Clearly this is not the case. Can I get the other columns from y into the result of x[y] with data.tables? It seems like it’s just taking the rows of x where the key matches the key of y, but ignoring the rest of y entirely…

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-05-13T17:20:09+00:00Added an answer on May 13, 2026 at 5:20 pm

    You are quoting the wrong part of documentation. If you have a look at the doc of [.data.table you will read:

    When i is a data.table, x must have a
    key, meaning join i to x and return
    the rows in x that match
    . An equi-join
    is performed between each column in i
    to each column in x’s key in order.
    This is similar to base R
    functionality of sub- setting a matrix
    by a 2-column matrix, and in higher
    dimensions subsetting an n-dimensional
    array by an n-column matrix

    I admit the description of the package (the part you quoted) is somewhat confusing, because it seems to say that the “[“-operation can be used instead of merge. But I think what it says is: if x and y are both data.tables we use a join on an index (which is invoked like merge) instead of binary search.


    One more thing:

    The data.table library I installed via install.packages was missing the merge.data.table method, so using merge would call merge.data.frame. After installing the package from R-Forge R used the faster merge.data.table method.

    You can check if you have the merge.data.table method by checking the output of:

    methods(generic.function="merge")
    

    EDIT [Answer no longer valid]: This answer refers to data.table version 1.3. In version 1.5.3 the behaviour of data.table changed and x[y] returns the expected results. Thank you Matthew Dowle, author of data.table, for pointing this out in the comments.

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

Note: This question is related to an advice rather than an issue. I would
Note: In this question I'm using the term autocomplete (or iterative search) to refer
Note: This question was influenced by this answer . The following is valid C
Note: I'm posing this question to answer it myself, but other answers are welcome.
(Note: this questions and its answers are from 2011 and show how the world
Note: this question has nothing to do with Knockout.js, but it's about the selectedOptions
Note this question was originally posted in 2009, before C++11 was ratified and before
Note: This question has broadened in scope from previous revisions. I have tried to
Note: this question is related to this one , but two years is a
Note: This is similar to this question but it is not the same. I

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.